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DNAs and proteins or peptides specific to bacteria of the 
species Neisseria, meningitidis, processes for obtaining them 
and their biological uses. 

5 The invention relates to DNAs and to proteins and 

peptides which are specific to bacteria of the species 
Neisseria meningitidis (abbreviated below to Nm) , to the 

process for obtaining them and to their biological uses, in 
particular for the prevention and detection of meningococcal 
10 infections and meningitis. 

It is known that Nm is one of the main agents of 
cerebrospinal meningitis . 

Studies conducted in the United States have shown that 5 
to 10% of the population are asymptomatic carriers of the Nm 
pi 5 strain (s). The transmission factors of Nm are poorly known. 

For a proportion of persons infected, Nm penetrates the 
|]j bloodstream, where it can cause meningococcaemia and/or 
iJ £ progress to the cerebrospinal stream, to cause meningitis. 
l]-* Without fast antibiotic treatment, the infection can develop 
5; l20 like lightning and become fatal. 

r ^ Compared with other pathogens, Nm has the characteristic 

OJ of being able to cross the haemato- encephalic barrier to 
*12 colonize the meninges. The study of the pathogenicity of Nm is 
.p therefore important not only in the context of meningitis, but 
1*25 also in the context of any disease which affects the brain. 

The benefit of having available tools specific to this 

species of bacteria for the uses envisaged above is therefore 

understood. 

Genetically, Nm is very close to bacteria of the species 
30 Neisseria gonorrhoeae (abbreviated to Ng below) and of the 
species Neisseria lactamica (abbreviated to Nl below) . 
However, their pathogenicity is very different. 
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Nm colonizes the nasopharynx, and then crosses the 
pharyngeal epithelium to invade the submucous space, thus 
being responsible for septicaemia and meningitis. 

Ng is especially responsible for infections located in 
the genitourinary tract. It colonizes the genital mucosa, and 
then crosses the epithelium, subsequently invading the 
subepithelium, where it multiplies and is responsible for a 
severe inflammatory reaction. Disseminated gonococcal 
infections are possible, but remain rare and are the result of 
only some strains . 

As regards Nl, it is considered that this is a non-pathogenic 
strain, since it is not responsible for a localized or general 
invasion. 

A first consideration thus led to taking into account the 
fact that Nm and Ng, while being bacteria very close to one 
another, have different pathogenic potencies. 

Since the genome of these bacteria has a high homology, 
only limited parts of the genome of Nm and Ng must code for 
specific virulence factors responsible for their pathogenesis. 

It is clear that Nm has, compared with Ng, DNA sequences 
which are specific to it and which must be involved in the 
expression of its specific pathogenic potency. 

The species Nm is subdivided into serogroups based on the 
nature of the capsular polysaccharides . 

At least 13 serogroups have been defined, among which 
serogroups A, B and C are responsible for about 90% of 
meningitis cases. Groups A and C are found in epidemic forms 
of the disease. Group B is the serogroup generally isolated 
the most in Europe and the United States. 

The capsule, which is present in Nm and absent from Ng, 
has served as the basis for formulating meningococcal 
antimeningitis vaccines. 
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The polysaccharides of the Nm capsule have been used to 
formulate a vaccine which has proved to be effective in 
preventing in adults the meningitis caused by meningococci of 
serogroups A, C, W135 and Y. 
5 However, the polysaccharide of Nm group C has proved to 

be weakly immunogenic in children of less than two years, 
while the polysaccharide of Nm group B is non- immunogenic in 
man and shares epitopes with adhesion glycoproteins present in 
human neuronal cells. 
10 There is therefore no universal vaccine capable of 

preventing infections caused by all the serogroups of the 
meningococci and capable of responding to the intrinsic 
antigenic variability of bacterial pathogens in general and Nm 
in particular. 

15 Because of the cross-reactivity of the Nm group B 

polysaccharide with human antigens, the multiplicity of the 
serogroups and the antigenic variability of Nm, the strategies 
1JJ proposed to date cannot lead to a vaccine which is effective 
£ in all situations. 

j : |o Research is therefore concentrated on study of the 

* characteristic elements responsible for the specificity of the 

Q meningococcal pathogenesis. 

lI. The majority of genes which have been studied in either 

42 of the two bacteria Nm or Ng have their homologue in the 
f : 25 second bacterium. 

In the same way, the majority of virulence factors 
identified to date in Nm have a counterpart in Ng, that is to 
say pilin, the PilC proteins, the opacity proteins and the 
receptors of lactoferrin and transferrin. 
3 0 The specific attributes of meningococci characterized in 

the prior art are the capsule, the Frp proteins analogous to 
RTX toxins, Opc proteins of the external member, glutathione 
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peroxidase, the porin PorA and the rotamase gene. 

Among these, only the capsule is invariably present in 
the virulent strains of Nm. However, several extracellular 
pathogens have a capsule without nevertheless crossing the 
5 haemato-encephalic barrier. 

Attributes which have not yet been identified must 
therefore be responsible for the specificity of the 
meningococcal pathogenesis. These attributes are probably 
coded by DNA sequences present among the meningococci but 
10 absent from the gonococci. 

The inventors have developed a new approach based on 
subtractive isolation of Nm- specific genes, which genes must 
be linked to the specific pathogenesis of Nm, and more 
particularly to crossing of the haemato- encephalic barrier. 
: . )T 15 The subtractive method developed in the prior art has 

|] resulted in the production of epidemological [sic] markers for 
^ some Nm isolates. These markers are of limited use: they do 
?n not cover all the serogroups of the Nm species. 

4* In contrast to these studies, the work of the inventors 

!'20 has led, by confronting Nm with the entire Ng chromosome 
t sheared in a random manner, to the development of a means for 

y cloning all the DNAs present in Nm and absent from Ng, thus 
providing tools of high specificity with respect to Nm, and 
;p thus enabling the genetic variability of the species to be 
rks responded to for the first time. 

The terms "present" and absent" used in the description 
and claims in relation to the DNAs of a strain or their 
expression products are interpreted on the basis of identical 
hybridization conditions (16 h at 65 °C, with NaP0 4 0.5 M, pH 
3 0 7.2; EDTA-Na 0.001 M, 1%, 1% bovine serum albumin and 7% 
sodium dodecyl sulphate) using the same probe and the same 
labelling intensity of the probe, the same amount of 
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chromosomal DNA and the same comparison element (chromosomal 
DNA of the homologous strain) . 

It is therefore considered that the DNA is present if the 
signal obtained with the probe is practically the same as that 
5 obtained with the reference strain. 

Conversely, it is considered that the DNA is absent if 
this signal appears very weak. 

A second consideration of the pathogenicities of Nm and 
Ng leads to taking into account their common capacity for 
10 colonization and penetration of the mucosa, and then invasion 
of the subepithelial space of the latter. It is highly 
probable that this process involves virulence factors common 
to the two pathogens. In this respect, it is known that a 
certain number of virulence factors have already been 
.,15 identified in Nm and in Ng, such as the pili proteins, Pile, 
"J*? the opacity proteins, the IgA proteases, the proteins for 
*Q binding to transferrin and to lactoferrin, and the 
"Ci lipooligosaccharides . 

4« The approach of the inventors is thus extended to 

^ko investigation of the Nm regions which are specific to Nm and 
; Ng but absent from the non-pathogenic species Nl, and in a 

general manner to investigation of the chromosomal regions of 
^7. the DNAs and their expression products specific to a given 
42 species by the means developed in accordance with the 
H25 invention. 

The object of the invention is thus to provide DNAs of Nm 

specific to its pathogenic potency and means for obtaining 

them, in particular by formulating banks formed exclusively 

from these Nm-specific DNAs. 
3 0 It also provides the products derived from these DNA 

sequences . 

The invention also rebates to the utilization of specific 
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and exhaustive characteristics of these banks to formulate 
tools which can be used, in particular, in diagnostics, 
treatment and prevention. 

The DNAs of the invention are characterized in that they 
are in all or part genes, with their reading frame, present in 
Neisseria meningitidis, but absent both from Neisseria 
gonorrhoeae and from Neisseria lactamica, with the exception 

of genes involved in the biosynthesis of the polysaccharide 
capsule, frpA, frpC, opc, por A, rotamase, the sequence 
IS1106, IgA proteases, pilin, pile, proteins which bind 
transferrin and opacity proteins. 

As stated above, the terms "present" and "absent" are 
interpreted on the basis of the hybridization conditions used 
in the Southern blotting described in the examples and 
referred to above . 

It can be seen that these DNAs include variants where 
these express a function intrinsic to the Nm species, more 
particularly a phenotype found solely in Nm or in common 
exclusively with Ng. 

According to a main aspect, these DNAs are specific to 
the pathogenicity of Neisseria meningitidis, in spite of the 

genetic variability of this species. 

According to an embodiment of the invention, the said 
DNAs are specific to Nm, in contrast to Ng. 

More particularly, the Nm-specific DNAs are absent from 
Neisseria lactamica and from Neisseria cinerea* 

Surprisingly, the majority of genetic differences between 
the strains of meningococci and those of gonococci appear 
grouped in distinct regions, which are said to correspond to 
the pathogenicity islets described previously for E. coli and 
Y. pestis. 

In a preferred embodiment of the invention, these DNA are 
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thus also charact erized in that they comprise one or more 
sequence (s) present on the chromosome of Neisseria 
meningitidis Z2491 between tufA and pilT, or region 1 of the 

chromosome, and/or the sequence (s) capable of hybridizing with 
the above sequence (s) , with the proviso of being specific to 
Neisseria meningitidis . 

"Specific" in the description and the claims means the 
nucleotide sequences which hybridize only with those of Nm 
under the hybridization conditions given in the examples and 
referred to above. 

In this respect, it can be seen that, in a general 
manner, when "all or part" of a sequence is referred to in the 
description and claims, this expression must be interpreted 
with respect to the specificity defined above. 

Furthermore, all or part of a peptide or a fragment of a 
peptide or an antibody means a product having the biological 
properties respectively of the natural peptide or the antibody 
formed against the peptide. 

Genes of the Neisseria meningitidis capsule are grouped 
in region 1 . 

DNAs of this type have a sequence corresponding in all or 
part to SEQ ID No. 9, 13, 22 or 30, and/or to any sequence 
located at more or less 2 0 kb from these SEQ ID on the 
chromosome of an Nm strain, and/or have a sequence which is 
capable of hybridizing with at least a fragment of any one of 
these sequences . 

In another preferred embodiment of the invention, these 
DNA are also characterized in that they are made up of one or 
more sequence (s) present 'on the chromosome of Neisseria 

meningitidis Z2491 between pilQ and A,740, or region 2 of the 

chromosome, and/or the sequences (s) capable of hybridizing 
with the above sequence (s) , with the proviso of being specific 
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to Neisseria meningitidis . 

DNAs according to this embodiment have a sequence 
corresponding in all or part to SEQ ID No. 1, 2, 4, 6, 7 , 10, 
15, 31 or 34, and/or to any sequence located at more or less 
2 0 kb from these SEQ ID on the chromosome of an Nm strain, 
and/or have a sequence which is capable of hybridizing with at 
least a fragment of any one of these sequences . 

The invention especially provides all or part of the DNA 
sequence SEQ ID No. 36 of 15,620 bp, and the sequences 
corresponding to the open reading frames SEQ ID No. 37, SEQ ID 
No. 38, SEQ ID No. 39, SEQ ID No . 40, SEQ ID No. 41, SEQ ID 
No. 42, SEQ ID No. 43, SEQ ID No. 44 and SEQ ID No. 45. 

In yet another preferred embodiment of the invention, 
these DNAs are also characterized in that they are made up of 
one or more sequence (s) present on the chromosome of Neisseria, 
meningitidis Z24 91 between argrF and opaB, or region 3 of the 
chromosome, and/or the sequence (s) capable of hybridizing with 
the above sequence (s) , with the proviso of being specific to 
Neisseria meningitidis . 

DNAs according to this embodiment are characterized in 
that they have a sequence corresponding in all or part to SEQ 
ID No. 8, 21, 23, 25, 26, 28, 29, 32 or 35, and/or to any 
sequence located at more or less 2 0 kb from these SEQ ID on 
the chromosome of an Nm strain, and/or have a sequence which 
is capable of. hybridizing with at least a fragment of any one 
of these sequences . 

Regions 1, 2 and 3 identified above have a high 
proportion of sequences specific to Neisseria meningitidis and 
also fall within the context of the invention. 

Other DNAs representative of the specificity with respect 
to Neisseria meningitidis have one or more sequences which 

is/are present on the chromosome of Neisseria meningitidis 
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Z24 91 but are not part of regions 1, 2 and 3 defined above. 

Such DNAs comprise one or more sequence (s) corresponding 
in all or part to SEQ ID No. 3, 5, 11, 12, 14, 16, 18, 19, 20, 
24, 27 or 33, and/or to any sequence located at more or less 
5 2 0 kb from these SEQ ID on the chromosome of an Nm strain, 
and/or have a sequence capable of hybridizing with such 
sequences . 

Taking into account the uses envisaged in particular, the 
invention more specifically relates to the above DNAs involved 
10 in the pathogenesis of the bacterial organism. 

In particular, it provides the DNAs corresponding to at 
least one of the characterizations given above and coding for 
a protein exported beyond the cytoplasmic membrane, and/or of 
which all or part of their sequence corresponds to the 
conserved region of the said DNAs. 
^ According to another embodiment of the invention, the 

DNAs are thus common with those of Ng, but are absent from 
pi those of Nl . 

% These are more specifically the DNAs which are present on 

s£0 region 4 (arg J to reg F) or on region 5 (lambda 3 75 marker to 
t_ pen A) on the chromosome of Nm Z2491 and/or are capable of 
^ hybridizing with the said DNAs present, with the proviso of 
K being specific to Nm and Ng, in contrast to Nl . 

»l "Specific to Nm and Ng in contrast to Nl" means the DNAs 

*,25 which hybridize with the DNAs of Nm and Ng under the 

hybridization conditions of the examples (see example 4 in 

particular) . 

The DNAs of regions 4 and 5 and those capable of 
hybridizing with these DNAs, with the proviso of expressing 
3 0 the intrinsic functions of Nm, have the advantage of 
intervening in a significant manner in the virulence of Nm, 
being involved in the stage of initial colonization and 
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penetration and in the septicaemic dissemination. 

According to other embodiments, the invention provides 
transfer and expression vectors, such as plasmids, cosmids or 
bacteriophages, comprising at least one DNA as defined above. 

It also provides host cells transformed by at least one 
DNA as defined above . 

Other host cells of the invention comprise genes or gene 
fragments specific to Nm, and are characterized in that their 
chromosome is deleted by at least one DNA according to the 
invention, in particular a DNA responsible for the 
pathogenicity. They are more specifically bacterial cells, in 
particular of Nm. 

The invention also relates to the RNAs of which the 
sequence corresponds in all or part to the transcription of at 
least one DNA sequence or sequence fragment as defined above. 

The invention also relates to the antisense nucleic acids 
of the DNAs as defined above, or of fragments of these DNAs . 

These antisense nucleic acids carry, where appropriate, 
at least one substituent, such as a methyl group and/or a 
glycosyl group . 

Other products which fall within the context of the 
invention include polypeptides. 

These polypeptides are characterized in that they have an 
amino acid chain corresponding to all or part of a sequence 
coded by the nucleic acids defined above, or deduced from 
sequences of these nucleic acids. 

They are advantageously polypeptides corresponding to all 
or part of the polypeptides exported beyond the cytoplasmic 
membrane, more specifically polypeptides corresponding to all 
or part of those coded by a conserved region. 

As a variant, the polypeptides of the invention can be 
modified with respect to those corresponding to the nucleic 
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acid sequences such that they are particularly suitable for a 
given use, in particular use as a vaccine. 

Modification is understood as meaning any alteration, 
deletion or chemical substitution where this does not affect 
the biochemical properties of the corresponding natural 
polypeptides, more specifically of functional proteins 
exported at the periplasm and the external membrane. 

Other products according to the invention include 
antibodies directed against the above polypeptides. 

The invention thus provides polyclonal antibodies, and 
also monoclonal antibodies, characterized in that they 
recognize at least one epitope of a polypeptide as described 
above . 

It also relates to fragments of these antibodies, more 
particularly the fragments Fv, Fab and Fab' 2. 

The invention also relates to the anti-antibodies which 
are capable of recognizing the antibodies defined above, or 
their fragments, by a reaction of the antigen-antibody type. 

According to the invention, the various products 
considered above are obtained by a synthesis and/or biological 
route in accordance with conventional techniques. 

The nucleic acids can also be obtained from banks made up 
of Nm-specific DNAs such as are formulated by a subtractive 
technique, this technique comprising: 

- mixing of two DNA populations, 

- realization of at least one subtractive hybridization- 
amplification iteration, and 

- collection of the desired DNA or DNAs, followed, where 
appropriate, by its/their purification with elimination of 
redundant sequences . 

According to the invention, the two DNA populations 
originate respectively from a strain of Neisseria. 
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meningitidis , the so-called reference strain for which the 
specific bank must be constructed, and a strain of Neisseria., 
the so-called subtraction strain, having a homology in primary 
DNA sequences of greater than about 70% with the Neisseria, 
meningitidis strain, the DNA sequences of the subtraction and 

reference strains being obtained respectively by random 
shearing, and by cleavage by a restriction endonuclease 
capable of producing fragments less than about 1 kb in size. 

The invention provides in particular a process for 
obtaining Neisseria meningitidis -specific DNA banks, 
comprising the stages of 

- random shearing of the chromosomal DNA of a strain of 
Neisseria gonorrhoeae, the so-called subtraction strain, in 

particular by repeated passage through a syringe, 

cleavage of the chromosomal DNA of a strain of 
Neisseria meningitidis , the so-called reference strain, 

preferably by a restriction enzyme producing fragments less 
than about 1 kb in size, 

- splicing of the DNA fragments of the reference strain, 
cleaved by the restriction enzyme, with suitable 
oligonucleotide primers, 

realization of a subtractive hybridization- 
amplification iteration, by: 

mixing of the two DNA populations under suitable 
conditions for hybridization of homologous sequences, and then 
amplification of auto-reannealed fragments and 
collection of these fragments, 

digestion of these fragments by a restriction enzyme 
and re-splicing with oligonucleotide primers, followed by a 

- purification of the spliced DNA and, where appropriate, 
a new iteration of the subtractive hybridization, comprising 
mixing of DNA fragments of Neisseria gonorrhoeae sheared as 
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indicated above with DNA fragments of Neisseria meningitidis 

produced by the preceding iteration, followed, if desired, by- 
cloning of the DNAs of the bank. 

The primers used are oligodeoxynucleotide primers which 
5 are suitable for the restriction endonuclease used and allow 
insertion into a cloning site, such as the EcoRI site of the 
plasmid pBluescript . Such primers will advantageously be 
chosen among the oligodeoxynucleo tides referred to in the 
sequence listing under SEQ ID no. 36 to 45. 

10 The banks thus obtained are formed from DNAs which are 

specific to meningococci and absent from gonococci. 

The specificity of the DNAs was verified, as described in 
the examples, at each iteration by Southern blots, with genes 
common to the subtraction strain and to the reference strain, 

J_5 or with the total DNA of each of the strains digested by a 

3 restriction endonuclease, such as Clal . 

j At each iteration, the exhaustivity of the DNA bank was 

1 also verified by Southern blotting with probes known to be 
Lj specific to the reference strain, that is to say for Neisseria 

2 0 meningitidis the frp, opc and rotamase genes in particular. 

I The experiments carried out showed that the banks 

f. obtained by the process of the invention are deficient in 

z genes having a significant homology with species of Neisseria 

j other than Neisseria meningitidis , for example the ppk or 

25 pilCl genes, generally in only 2 or 3 iterations. 

If necessary, two routes, which are not exclusive of each 
other, can be taken. 

It is possible to proceed with an (n+l) th iteration using 
the DNA of iteration n as the DNA population of the reference 

3 0 strain. 

As a variant, a second bank independent of the first is 
constructed, with a restriction enzyme of different 
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specificity to that used in the first bank, for example Mbol • 

In all cases, it is preferable to keep each of the 
products produced by each of the iterations performed. 

The invention also provides the use of the subtractive 
technique described above to obtain banks of the DNAs common 
to Nm and Ng, but specific with respect to Nl . 

Three different banks are advantageously constructed, two 
of them by digestion of the chromosomal DNA of Nm by Mbol and 

Tsp5091, and the third by digestion of the chromosomal DNA of 

Nm with Mspl. Two subtraction series allow the DNAs having the 

required specificity to be collected, as described in the 
examples . 

The invention also relates to the process for obtaining 
these banks and the banks themselves. 

It can be seen that, generally, the process of the 
invention can be used to obtain banks of DNAs specific to a 
given cell species, or to a given variant of the same species, 
where another species or another variant which is close 
genomically and expresses different pathogenic potencies 
exists . 

Using the process of the invention, DNA banks specific to 
given species of cryptococci, Haemophilus, pneumococci or also 

Escherichia. coli f or more generally any bacterial agent 

belonging to the same species and having different pathovars 
will advantageously be constructed. 

Furthermore , from these banks the invention provides the 
means to have available virulence factors specific to a 
species or a given variant. 

Such banks are therefore tools which are of great 
interest for having available attributes which are responsible 
for the specificity of a pathogen, this use being more 
specifically illustrated according to the invention by the 
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obtaining of banks comprising the attributes responsible for 
the specificity of the meningococcal pathogenesis. 

Study of the products of the invention, the nucleic 
acids, polypeptides and antibodies, has enabled an absolute 
5 specificity with respect ot Neisseria meningitidis, regardless 
of the strain and its variability, to be demonstrated. 

These products are therefore particularly suitable for 
diagnosis or prevention of infections and meningitis caused by 
Neisseria, meningitidis , whether in adults or children and 

10 regardless of the serogroups of the strain in question. 

The method for diagnosis, according to the invention, of 
a meningococcal infection, and more particularly of 
meningococcal meningitis, by demonstration of the presence of 
Neisseria meningitis in an analytical sample is characterized 

3.5 by the stages of : 

~: - bringing into contact a sample to be analysed, that is 

to say a biological sample or a cell culture, and a reagent 
i formulated from at least one nucleic acid as defined above, if 
ij appropriate in the form of a nucleotide probe or a primer, or, 
t>0 as a variant, from at least one antibody or a fragment of an 
1 antibody as defined above, under conditions which allow, 
i respectively, hybridization or a reaction of the antigen- 
* antibody type, and 

J - detection of any reaction product formed. 

25 If the reagent is formulated from a nucleic acid, this 

can be in the form of a nucleotide probe in which the nucleic 
acid or a fragment of the latter is labelled in order to 
enable it to be detected. Suitable markers include 
radioactive, fluorescent, enzymatic or luminescent markers. 

3 0 As a variant, the nucleic acid is included in a host 

cell, which is used as the reagent. 

In these various forms, the nucleic acid is used as such 
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or in the form of a composition with inert vehicles. 

If the reagent is compiled from an antibody, or a 
fragment of an antibody, this can be labelled for detection 
purposes. Most generally, a fluorescent, enzymatic, 
5 radioactive or luminescent marker is used. 

The antibody or the antibody fragment used, which is 
labelled if appropriate, can be used as such or in the form of 
a composition with inert vehicles. 

The sample used in the stage of bringing the components 
10 into contact is a biological sample produced by a mammal, such 
as cephalorachidian fluid, urine, blood or saliva. 

The detection stage is carried out under conditions which 
allow the reaction product to be demonstrated when it is 
formed. Conventional means use fluorescence, luminescence, 
_15 colour or radioactive reactions, or also autoriadography [sic] 
3 techniques. It is also possible to quantify the product. 
^ The invention also relates to the labelled products, the 

nucleic acids and antibodies, as new products. 
4* The method defined above can be used for diagnosis of an 

r^O immune reaction specific to a meningococcal infection. 
, The reagent used is thus a polypeptide according to the 

y invention, as coded by the said nucleic acid sequences, 
5,; corresponding to the natural product or a polypeptide which is 
4* modified but has the biological and immunological activity of 
ris the corresponding natural polypeptide. 

It is advantageously a polypeptide exported beyond the 
cytoplasmic membrane of Neisseria meningitidis, more 

particularly the part of such a polypeptide corresponding to 
the conserved region of the DNA. 
30 The invention also relates to kits for carrying out the 

methods defined above. These kits are characterized in that 
they comprise: 
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- at least one reagent as defined above, that is to say 
of the nucleic acid, antibody or polypeptide type, 

products, in particular markers or buffers, which 
enable the intended nucleotide hybridization reaction or 
immunological reaction to be carried out, as well as use 
instructions . 

The specificity of the products of the invention and 
their location on the chromosome of Neisseria, meningitidis 
Z2491, either grouped in a region and able to be interpreted 
as pathogenicity islets, or isolated on the chromosome, impart 
to them a very particular interest for realization of vaccine 
compositions with a universal purpose, that is to say whatever 
the strain and the variability which it expresses. These 
compositions can include in their spectrum other prophylaxes, 
and can be, for example, combined with childhood vaccines. 

The invention thus provides vaccine compositions which 
include in their spectrum ant imeningococcal prophylaxis, 
intended for prevention of any infection which may be caused 
by Neisseria meningitidis, these compositions being 
characterized in that they comprise, in combination with (a) 
physiologically acceptable vehicle (s), an effective amount of 
polypeptides or anti- antibodies or their fragments as defined 
above, these products optionally being conjugated, in order to 
reinforce their immogenicity [sic] . 

Immunogenic molecules which can be used comprise the 
poliovirus protein, the tetanus toxin, or also the protein 
produced by the hypervariable region of a pilin. 

As a variant, the vaccine compositions according to the 
invention are characterized in that they comprise, in 
combination with (a) physiologically acceptable vehicle (s), an 
effective amount : 

- of nucleic acids as defined above, 
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- of transformed host cells as defined above, or 

- of Nm cells, the chromosome of which has been deleted 
by at least one DNA sequence according to the invention 
involved in the pathogenicity of the bacterium. The nucleotide 
material used is advantageously placed under the control of a 
promoter of its expression in vivo and synthesis of the 
corresponding protein. To reinforce the immunogenicity, it is 
also possible to combine this nucleic material with a DNA or 
an RNA which codes for a carrier molecule, such as the 
poliovirus protein, tetanus toxin or a protein produced by the 
hypervariable region of a pilin. 

The vaccine compositions of the inventions can be 
administered parenterally, subcutaneously, intramuscularly or 
also in the form of a spray. 

Other characteristics and advantages of the invention are 
given in the examples which follow for illustration thereof, 
but without limiting its scope. 

In these examples, reference will be made to figures 1 to 
11, which show, respectively, 

- figures 1A, IB, 1C, ID, IE, IF and 1G: analysis of the 
subtract ive bank Tsp5 0 91, 

- figure 2: the distribution of the Nm-specific sequences, in 
contrast to Ng, on the chromosome of the strain Z2491 (left- 
hand part) and of Nm-specific sequences, in contrast to Nl 
(right-hand part) , 

- figure 3A to 3C: the reactivity of the clones of the 3 
regions of the chromosome according to the invention towards a 
panel of strains of the genus Neisseria, 

- figure 4: the position in region 2 of the chromosome of Nm 
of oligonucleotides used as probes, 

- figures 5, 6 and 7: the Southern blots of a panel of strains 
of the genus Neisseria., using parts of region 2 of Nm as 
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probes , 

- figures 8A to 8C: the Southern blots with 3 subtractive 
banks over a panel of 12 strains of Neisseria, and 

- figures 9, 10 and 11: the reactivity of clones of the 3 
subtractive banks with respect to Nm, Nl and Ng. 

In the examples which follow, the following materials and 
methods were used: 

Bacterial strains - To obtain the subtractive banks, strain 

Z2491 of Nm (Achtman et al . , 1991, J". Infect. Dis. 164, 375- 

382), the strains MS11 (Swanson et al . , 1974, Infect. Immun. 

10, 633-644) and the strains 8064 and 9764 of Nl were used, it 
being understood that any other strain of the species in 
question could be used. 

In order to verify the specificity of these banks, 6 
strains of Nm, 4 strains of Ng, one strain of Nl {Neisseria 

lactamica) and one strain of Nc {Neisseria cinerea) were used. 

The six strains of Nm are: Nm Z2491 of serogroup A, Nm 
8013 of serogroup C (XN collection) , Nm 1121, no serogrouping 
possible (XN collection) , Nm 1912 serogroup A (XN collection) , 
Nm 7972 of serogroup A (XN collection) and Nm 8216 of 
serogroup B (XN collection) . 

The four strains of Ng are: Ng MS11 (Pasteur Institute, 
Paris) , Ng 403 (Pasteur Institute, Paris) , Ng 6934 (Pasteur 
Institute, Paris) , Ng WI (isolated from a disseminated 
gonococcal infection), Ng 4C1, Ng 6493 and Ng FA 1090. 

The strains of Nl are Nl 8064 and Nl 9764 (XN 
collection) , and that of Nc is Nc 32165 (XN collection) . 

Molecular genetics techniques 

Unless indicated otherwise, the techniques and reagents 
used correspond to those recommended by Sambrook et al 

(Sambrook et al 1989, Molecular Cloning: A Laboratory Manual. 

Cold Spring Harbor Laboratory Press) . The 
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oligodeoxynucleotid.es used in this study are: 



RBAml2, 3 r AGTGGCTCCTAG 54 (SEQ ID No - 54) 

RBam24, 5' AG CACT CT C CAGC CT CT C AC CGAG 3'; (SEQ IN No . 55) 
Jbaml2, 3' GATCCGTTCATG 5 r ; (SEQ ID No. GO) 
JBAM24 , 5' ACCGACGTCGACTATCCATGAACG 3'; (SEQ ID No. 61) 
REcol2, AGTGGCTCTTAA; (SEQ ID No. 56) 

REco24, 5' AGCACTCTCCAGCCTCTCACCGAG 3 1 ; (= RBam 24) 
JEcol2, GTACTTGCTTAA; (SEQ ID No. 62) 
JEco24, 5' ACCGACGTCGACTATCCATGAACG 3'; (= JBam24) 
NEcol2, AATTCTCCCTCG; (SEQ ID No. 64) 
NEco24, AGGCAACTGTGCTATCCGAGGGAG; (SEQ ID No . 65) . 
Transfer to membranes (Southern blots) 

The transfers to membranes were effected by capillary 
transfers to positively charged nylon membranes (Boehringer 
Mannheim) . The hybridizations were carried out at 65 °C in a 
solution comprising NaPi [sic] 0.5 M pH 7.2/EDTA 1 mM/SDS 
7%/BSA 1%. The membranes were washed in a solution comprising 
NaPi [sic] 40 mM pH 7.2/EDTA 1 mM/SDS 1%. The final washing 
was carried out at 65 °C for 5 min. 

The probe frp obtained with oligonucleotides based on the 

frpA sequence corresponds to 2.4 kb of the 5' end of the gene 

of the strain Z2491. The opc and rotamase probes corresponding 

to whole genes are produced from the strain Z24 91 using 
oligonucleotides constructed on the basis of published 
sequences. The probes pilCl and ppk (polyphosphate kinase) 

correspond to inserts of the plasmids pJLl and pBluePPKGOOl 
respectively. 

Example 1: Construction of banks of DNAs present in Nm and 
absent from Na . 
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a. "Mbol" bank 

Construction - The DNA of Nm Z2491 was cleaved by the 
endonuclease Mbol and subjected to two iterations of a method 
called CDA (comprehensive difference analysis) below. This 
method comprises subtractive hybridization in the presence of 
excess sheared DNA of Ng MS11 and amplification by PCR of 
those meningococcal sequences which, since they are absent 
from or do not have significant homology with the DNA of Ng 
MS11, could reanneal 

The chromosomal DNA of the strain Ng MS11 is sheared 
randomly by repeated passage through a hypodermic syringe 
until fragments of a size ranging from 3 to 10 kb are 
obtained. These DNA fragments are purified by extraction with 
phenol . 

The chromosomal DNA of the strain Nm Z24 91 is itself 
cleaved by the restriction endonuclease Mbol. These DNA 

fragments (20 jag) are spliced with 10 nmol of annealed 
oligonucleotides RBaml2 and RBam24 . The excess primers are 
removed by electrophoresis over 2% agarose gel of low melting 
point. The part of the gel containing amplified fragments 
greater than 200 bp in size is excised and digested by p- 
agarase. These fragments are purified by extraction with 
phenol . 

To carry out a subtractive hybridization (first 
iteration), 0.2 \xg of the Nm DNA spliced with the RBam 
oligonucleotides is mixed with 4 0 |ixg Ng DNA in a total volume 
of 8 ml of a buffer EE 3X (a buffer EE IX is composed of N- (2- 
hydroxyethyl) piperazine-N ' - (3 -propanesulphonic acid) 10 mM and 
EDTA 1 mM, and its pH is 8.0). This solution is covered with 
mineral oil and the DNA is denatured by heating at 100 °C for 2 
min. 2 jlxI NaCl 5 M are added and the mixture is left to 
hybridize at 55 °C for 48 h. The reaction mixture is diluted to 
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1/10 in a preheated solution composed of NaCl and buffer EE, 
and in then immediately placed on ice. 

10 ill of this dilution are added to 400 jul of PCR 
reaction mixture (Tris.HCl pH 9.0 10 mM; KC1 50 mM; MgCl 2 1.5 
mM; Triton X100 0.1%; 0.25 mM of each of the four triphosphate 
deoxynucleotides; Taq polymerase 50 units per ml) . The mixture 
is incubated for 3 min at 70 °C to complete the ends of the 
reannealed meningococcal DNA fragments. 

After denaturing at 94 °C for 5 min and addition of the 
oligonucleotide RBam24 in an amount of 0.1 nmol per 100 \xl , 
the hybridizations are amplified by PCR (30 cycles of 1 min at 
94 °C, 1 min at 70 °C and 3 min at 72 °C, followed by 1 min at 
94°C and 10 min at 72 °C; Perkin-Elmer GeneAmp 9600) . 

The amplified meningococcal fragments are separated from 
the primers and high molecular weight gonococcal DNAs on gel. 
They are digested by Mbol and the oligonucleotides JBaml2 and 

JBam 24 are spliced to them again. These spliced DNAs are 
again purified over gel and extracted with phenol. 

A second iteration of the subtractive hybridization is 
carried out on 4 0 jig of the randomly sheared Ng DNA and 25 ng 
of the DNA spliced with the JBam oligonucleotides obtained 
from the first iteration of the subtractive hybridization. 
During this second iteration, amplification of the auto- 
annealed Nm DNA is effected with the aid of the 
oligonucleotide JBam24 . 

Specificity - In order to confirm their Nm specificity, 

the amplified sequences after the second iteration of the CDA 
method are labelled and used as a probe for the DNA digested 
by Clcil produced from a panel of six strains of Neisseria. 

meningitidis , four of Neisseria, gonorrhoeae, one of Neisseria 

lactamica and one of Neisseria cinerea. 

The Southern blots obtained show that the amplified 
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sequences resulting from the second iteration of the CDA 
method have a high reactivity with several bands corresponding 
to meningococci, and do not have a reactivity with the bands 
corresponding to the Ng, Nl and Nc strains. 
5 The "Mbol" bank thus appears to be Nm-specific. 

Exhaustivity - In order to test the exhaustivity of the 

bank, all the products produced from the first and second 
iterations of the CDA method and also the initial chromosomal 
materials of Nm Z2481 [sic] and Ng MS11 are subjected to 
10 agarose gel electrophoresis, transferred to a membrane and 
brought into contact with probes comprising genes known to be 
meningococcus- specif ic, that is to say frp, opc and rotamase 

(Southern blotting) . 

As a result of these hybridizations, the Nm-specific gene 
H3- 5 frp is represented in the Mbol bank by a fragment of 600 bp, 

^ but no activity is observed for the rotamase and opc genes. 

The Mbol bank, although Nm-specific, therefore cannot be 

J:: considered exhaustive. 

^; Given their high specificity, the fragments produced by 

, SO the second iteration of the CDA method for the Mbol bank can 

h* nevertheless be cloned on the BanzHI site of the plasmid 

ul 

pBluescript . 

7*1 A sequence corresponding to any of the Nm-specific genes 

Lh can be included in the subtractive bank only if it is carried 
25 by a restriction fragment of appropriate size. This condition 
is a function of two factors. Firstly, the probability that 
the largest fragments are entirely Nm-specific is low. 
Secondly, even if such fragments existed, they would be under- 
represented in the bank because of the limitations of the PCR 
3 0 technique, the amplification effectiveness of which decreases 
with increasing size of the fragments. Fragments greater that 
about 60 0 bp in size are not included in the bank. Because of 
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the absence of Mbo fragments of suitable size from the 

chromosome of Nm Z24 91, the rotamase and opc genes cannot be 

included in the bank. Any enzyme cannot by itself produce a 
small fragment corresponding to any Nm-specific gene. A second 
bank was therefore constructed using another restriction 
enzyme with a different specificity: Tsp509 [sic] . 

Jb. "Tsp5091" bank 

Construction - The enzyme TspSOSl has the advantage of 
producing fragments of smaller size (less than about 1 kb) 
than the enzyme Mbo J. 

Tsp509X recognizes the sequence AATT and leaves, 

projecting at 5' y a sequence of 4 bases compatible with EcoRX . 

The oligonucleotides used are Reco, Jeco and NEco. 

The method followed conforms with that followed for 
construction of the "Mbol" bank described above. However, 

higher quantities of meningococcal DNA were used for the first 
iteration of the subtractive hybridization in order to 
compensate for the higher number of fragments of low molecular 
weight produced by Tsp5Q9I. For the first iteration, 400 ng Nm 

DNA fragments and, in the second, 2 5 ng Nm fragments are 
subjected to subtractive hybridization with 4 0 p,g randomly 
sheared Ng DNA. 

For the construction of this H rsp5 09I rf bank, as a 
control, a third iteration of the subtractive hybridization is 
carried out using 4 0 p,g sheared Ng DNA and 0.2 ng Nm fragments 
resulting from a digestion by Tsp509I and a resplicing, with 

NEco adaptors, of the fragments obtained as a result of the 
second iteration . 

Specificity - As described for the previous bank, the 

product resulting from the second iteration of the CDA method 
is labelled and used as the probe for a panel of strains of 
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Neisseria. 

Figure 1A illustrates the Southern blot hybridization of 
products of the second iteration of the CDA method with the 
DNA digested by Clal of: Nm in track a, Ng MS11 in track b, Nm 

5 8013 in track c, Ng 403 in track d, Nm 1121 in track e, Ng 
6934 in track f, Nm 1912 in track g, Ng WI (strain DGI) in 
track h, Nm 7972 in track i, Nl 8064 in track j, Nc 32165 in 
track k, Nm 8216 in track 1. 

In contrast to the high reactivity observed with all the 
10 Nm strains, a low or no reactivity is observed with the Ng, Nl 
and Nc strains . 

The specificity of the bank was studied earlier by 
reacting membrane transfers (Southern blots) of the products 
produced by each of the three iterations of the CDA method 
■^§.5 with probes corresponding to pilCl and ppk. These two genes 

1~ are common to Nm and Ng. 

;1j Figure IB shows an agarose gel after electrophoresis of 

™! the chromosomes of Nm Z2491 and Ng Msll, digested by Tsp509 

in [sic] , and products resulting from each of the iterations of 
; 2 0 the CDA method. 

Q In track a 1 jxg of the chromosome of Nm was deposited, in 

^ track b 1 jag of that of Ng, in track c 0.15 jig of the products 
21 resulting from the first CDA iteration, in track d 0.1 jag of 
those of the second iteration, in track e 0.05 jug of the third 
25 iteration, MW representing the molecular size markers. 

Figures 1C and ID show gels obtained as described in 
figure IB after transfer to the membrane (Southern blots) and 
hybridization with pilCl (figure 1C) and ppk (figure ID) . 

At the end of the second iteration of the CDA method, the 
3 0 sequences corresponding to the pilCl and ppk genes are 

completely excluded from the bank. 

Exhaustivity - The exhaustivity of the bank was examined 
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by reacting the products resulting from the subtractive 
hybridization with the probes corresponding to three Nm- 
specific genes (frp, rotamase and opc) . 

These Nm-specific probes react with the amplification 
products resulting from the first and second iteration of the 
subtractive hybridization. 

Figures IE, IF and 1G show gels obtained as described in 
figure IB after transfer to the membrane (Southern blots) and 
hybridization with frpA (figure IE) , rotamase (figure IF) and 
opc (figure 1G) . 

However, a third iteration of the subtractive 
hybridization leads to the loss of Nm-specific sequences, 
since the fragments which react with the rotamase and opc 
genes are absent from this third iteration. 

In consideration of all these data, it emerges that the 
products resulting from the second iteration of the CDA method 
are Nm-specific and also constitute an exhaustive bank of Nm- 
specific sequences . 

The products resulting from this second iteration are 
cloned at the EcoRI site of the plasmid pBluescript. 

The bank produced by Tsp509I is more exhautive [sic] than 

the bank produced by Mbol, as the theory considerations based 

on the enzymatic production of smaller restriction fragments 
would suggest . 

In accordance with this aspect, it should be noted that 
the Tsp509I bank is less redundant than the Mbol bank, that is 
to say it comprises less duplication of clones. 86% of the 
clones of the Tsp509I bank* correspond to distinct sequences, 

while only 43% of the clones correspond to distinct sequences 
in the Mbol bank (data not shown) . 

The bank produced by Tsp50 9I thus constitutes a source of 

Nm-specific clones . 
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Example 2: Analysis of the clones of the subtractive bank 

Cloning and sequencing of the Nm-specific DNAs 

The DNAs of the subtractive banks are clones at the BairiHI 
5 (Mbol bank) or £*coRI (Tsp5Q9I bank) site of the plasmid 
pBluescript, and then transformed in DH5a of E. coli. The 

inserts are amplified by PCR carried out on the transformed 
colonies using the primers M13-50 and M13-40, the latter 
primer being biotinylated on its 5 T end. 
10 Sequencing was carried out on each PCR product after 

separation of the biotinylated and non-biotinylated strands 
using the system of Dynabeads M-280 with streptavidin (Dynal, 
Oslo) . The sequences are screened according to their 
homologies with previously published sequences using the 
yPL5 computer programs Blastn and Blastx (NCBI, USA and Fasta) . 

The PCR products resulting from the transformed bacteria 
V| colonies after using the primers M13-40 and M13-50 as 
t« described above were labelled by incorporation with random 
4 priming of a- 32 P-dCTP and were used as a probe for the membrane 
transfers of the chromosomal DNA digested by Clal of strains 

if Nm Z24 91 and Ng MS11, as described above, in order to verify 
1; their specificity. 

Mapping of clones on the chromosome of the strain Nm 
25 Z2491. 

The results of studies carried out with 17 clones of the 
"Mbol" bank (designated by the letter B) and 16 clones of the 

"Tsp5091" bank (designated by the letter E) , each of these 
clones having a unique sequence and being without counterpart 
3 0 in Ng, are reported. 

The positions of the DNA sequences corresponding to 
cloned Nm-specific products were determined with respect to 
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the published map of the chromosome of Nm Z24 91 (Dempsey et 

al. 1995, J. Bacteriol. 177 , 6390-6400) and with the aid of 

transfers to membranes (Southern blots) of agarose gel 
subjected to pulsed field electrophoresis (PFGE) . 

The Nm- specif ic clones are used as probes for a 
hybridization on membranes (Southern blots) of the DNA of Nm 
Z2491 digested with enzymes of rare cutting sites, that is to 
say Pad, Pmel, Sgfl, Bglll , Spel Nhel and Sgfl. 

The gels (20 x 20 cm) were gels of 1% agarose in a buffer 
TBE 0.5X and were subjected to electrophoresis at 6 V/cm for 
3 6 hours according to pulsation periods varying linearly 
between 5 and 3 5 seconds. 

The hybridizations on the membrane (Southern blots) were 
carried out as described above. 

The results obtained are shown on figure 2: the 
reactivity was located by comparison with the positions of the 
fragments of corresponding size on the published map. The 
positions of all the genetic markers mapped by Dempsey et al 
(mentioned above) are visualized with the aid of points on the 
linear chromosomal map. The Nm-specific genes disclosed 
previously are labelled with an asterisk. The two loci called 
"frp" correspond to the frpA and frpC genes. The "pilC" loci 
correspond to the pilCl and pilC2 genes, which are pairs of 

homologous genes and are not distinguished on the map. The 
accuracy of the positions of the Nm-specific clones of the 
invention depends on the overlapping of reactive restriction 
fragments. On average, the position is +/- 20 kb. 

This mapping reveals a non-random distribution of the Nm- 
specific sequences. The majority of the Nm-specific sequences 
belong to three distinct groups. One of these groups (region 
1) corresponds to the position of genes relating to the 
capsule which have been described previously. 
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A distinction is made between: 

- E109, E138, B230 and B323 as being region 1, 

- B322, B220, B108, B132, B233, B328, E139, E145 as B101 
as being region 2, and 

5 - B306, E114, E115, E124, E146, E120, E107, E137 and 142 

as being region 3 . 

63% of the sequences identified as specific to meningococci 
are located inside these three distinct regions. 

This grouping contrasts with the distribution of 
10 previously disclosed Nm-specific genes (frpA, frpC, poxA, opc 

and the region relating to the capsule) . 

This prior art would suggest in fact that the Nm-specific 

genes, with the exception of functional genes relating to the 

capsule, were dispersed along the chromosome. 
^15 Mapping of Nm-specific sequences on the chromosome leads 

iii to an unexpected result with regard to the prior art. 
JJ:! The majority of the genetic differences between the 

93 meningococcal and gonococcal strains tested are grouped in 

three distinct regions. 

is 

sj!0 Meningococcal genes relating to the capsule are grouped 

- in region 1 . 

The function of genes of the other regions is unknown, 
''r* but homologies with published sequences (table 1) suggest 

similarities between certain genes of region 3 and 
f£S bacteriophage transposase and regulatory proteins. No 

meningococcal ' virus has been characterized and it is tempting 

to think that these sequences are of phagic origin. 

Interestingly, the genome of H. influenzae also contains a 

sequence homologous to that of the Ner regulatory protein of 
30 phage Mu, but it is not known if it is a functional gene. 

The clone B208 has a high homology (48% identical, 91% 
homology for 33 amino acids) with a clone of conserved regions 
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field III) in the class of proteins which bind to TonB- 
dependent ferric siderophors. 

The proximity of this clone with the Nm- specific porA 

genes and the frp genes regulated by iron, and in particular 

5 the possibility that it is an Nm- specif ic receptor protein 
exposed on the external membrane in itself is a good candidate 
for further research. 

The clone B339 corresponds to the Nm-specific insertion 
sequence 1S1106. 

10 The low homology between the clone B134 and the Aeromona.s 

insertion sequence and also the presence of multiple copies of 
the clone B134 among the various strains of Nm suggest that it 
could be a new type of Nm-specific insertion sequence. 

The possibility that the regions containing the Nm- 
;5.5 specific clones could correspond to pathogenicity islets as 
41 described previously for E. coli and Y. pestis is of 

fv[ particular interest. 

;F; The clones isolated in this invention will allow better 

Zl understanding of the relevance of Nm-specific regions in 
%20 allowing cloning and sequencing of larger chromosomal 
!pj fragments, and directly by their use for loci mutations. 
\^ Finally, detection of meningococcus -specific genes 

;!: possibly involved in the pathogenicity of the organism allows 
Z= : targeting of suitable antigens which can be used in an 
25 antimeningococcal vaccine. 

The effectiveness and the speed of the method according 
to the inventions enables it to be used in a large number of 
situations for which the genetic differences responsible for a 
phenotype peculiar to one of 2 close pathogens are 
3 0 investigated. 
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Study of the reactivity of the clones of regions 1, 2 and 3 

towards a panel of strains of Neisseria. . 

The PCR products corresponding to inserts of each of the 
clones were collected and used as probes for hybridization on 
5 membranes (Southern blots) for a panel of strains of Nm, Ng, 
Nl and Nc. 

Regions 1 and 2 produce a limited number of bands for 
each of the meningococci. This suggests that these regions are 
both Nm-specific and common to all the meningococci. 

0 Figure 3 illustrates the reactivity of the clones of 

regions 1, 2 and 3 towards a panel of neisserial strains. The 
clones of regions 1 (figure 3A) , 2 (figure 3B) and 3 (figure 
3C) taken together were used as probes towards a panel of 
meningococci, gonococci and towards a strain of Nl and Nc . 

5 The tracks are as follows: DNA of: Nm Z2491 in track a, 

of Ng MS11 in track b, of Nm 8013 in track c, of Ng 403 in 
track d, of Nm 1121 in track e, of Ng 6934 in track f, of Nm 
1912 in track g, of Ng WI (strain DGI) in track h, of Nm 7972 
in track i, of Nl 8064 in track j, of Nc 32165 in track k, and 

0 of Nm 8216 in track 1. 

Remarkably, region 3 has reactivity only with the 
meningococci of serogroup A. This region 3 is therefore 
specific to a sub-group of Nm. 

A comparison was made with the known sequences in the 

5 databanks in order to evaluate the possible functions of the 
cloned regions . 

Table 1 which follows gives the positions of specific 
clones on the chromosomal map and the homologies with known 
sequences . 
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Firstly, it can be seen that the clones of region 1 all 
correspond to genes involved in biosynthesis of the capsule. 
These genes have previously been studied among the Nm of 
serogroup B (Frosch et al . 1989, Proc . Natl. Acad. Sci. USA 
86, 1669-1673 and Frosch and Muller 1993, Mol . Microbiol. 8. 
483-493) . 

With the exception of a low homology with the haemolysin 
activator of Serratia marcescens, the clones of region 2 have 
no significant homology with published sequences, either in 
the DNA or the proteins . 

Two of the clones of region 3 have interesting homologies 
with proteins which bind to the DNA, in particular the 
bacteriophage regulatory proteins and transposase proteins. 

Clone B2 0 8 has a high homology with one of the conserved 
regions in one class of receptors (TonB- dependent ferric 
siderophor) . 

Clones B134 and B339 hybridize with several regions of 
the chromosome (at least 5 and at least 8 respectively) . 

Data relating to the sequences show that clone B3 3 9 
corresponds to the Nm-specific insertion sequence S1106. 

The translation of the clone B143 has a limited homology 
with the transposase of an Aeromonas insertion sequence (SAS2) 
(Gustafson et al . 1994, J. Mol. Biol. 237 , 452-463). We were 
able to demonstrate by transfer on a membrane (Southern blots) 
that this clone is an Nm-specific entity present in multiple 
copies in the chromosomes of every meningococcus of the panel 
tested . 

The other clones have no significant homology with the 
published neisserial sequences, and furthermore nor with any 
published sequence. These clones therefore constitute, with 
the majority of the other clones isolated, a bank of totally 
new Nm-specific loci. 
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Example 3 : Study of region 2 of the Nm chromosome 

. Determination and characterization of the sequence of 
region 2 

PCR amplification is carried out with the chromosomal DNA 
of strain Z2491 of serogroup A, sub-group IV- 1 using 
oligonucleotide primers formulated from each of the sequences 
of clones of region 2 in several different combinations. The 
PCR products which overlap are sequenced from the 2 strands 
using the chain termination technique and automatic sequencing 
(ABI 373 or 377) . 

To prolong the sequence beyond the limits of the clones 
available, partial SauIIIA fragments of 15 kb of the strain 
Z24 91 are cloned in Lambda DASH- II (Stratagene) . 

The phages containing the inserts overlapping region 2 
are identified by hybridization with clones of this region as 
probes. The DNA inserted is sequenced from the ends of the 
inserts, and these sequences are used to formulate new primers 
which will serve to amplify the chromosomal DNA directly, and 
not the phagic DNA. 

An amplification of the chromosomal DNA is obtained using 
these new primers and those of the sequence previously 
available . 

These PCR products are also sequenced from the 2 strands, 
which leads to a complete sequence of 15,620 bp (SEQ ID No. 
36) . The reading frames of this sequence which start with ATG 
or GTG and are characterized by a high codon usage index are 
analysed. 

This analysis reveals 7 ORFs of this type which fill the 
major part of the sequence of 15,620 bp. The positions of 
these ORFs are the following: 
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ORF-1: 1330 to 2970 (SEQ ID No. 37); ORF-2 : 3083 to 9025 (SEQ 
ID No. 38); ORF-3: 9044 to 9472 (SEQ ID No. 39); ORF-4 : 10127 
to 12118 (SEQ ID NO. 40); ORF-5: 12118 to 12603 (SEQ ID No. 
41); ORF-6: 12794 to 13063 (SEQ ID No. 43); ORF-7: 13297 to 
14235 (SEQ ID No. 44); and ORF-8: 14241 to 15173 (SEQ ID No. 
45) . 

Orf-4 starts with the codon GTG and overlaps a slightly 
smaller ORF (SEQ ID No. 41) in the same reading frame (9620- 
12118, frame 2) , which starts with the codon ATG. 

ORF-4 codes for a protein which has structural homologies 
with a family of polypeptides comprising pyocins (Pseudomonas 
aeruginosa) , collcins and intimins (Escherichia coli) , which 
are bactericidal toxins (pyocins, collcins) or surface 
proteins involved in adhesion of bacteria to eukaryotic 
proteins. ORF-7 encodes a protein, the sequence of which 
contains a potentially transmembrane region and which has 
structural homologies with periplasmic proteins or proteins 
inserted in the external membrane of bacteria. The structural 
homologies of ORF-4 and ORF-7 have been identified with the 
aid of the PropSearch program. 

Investigation of sequences homologous to other ORFs in 
GenBank with the aid of the BLAST program revealed a homology 
between the N-terminal regions of ORF-2 and filamentous 
haemagglutinin B of Bordetella pertussis (43% similarity, 3 6% 
identical over 3 52 amino acids) and between ORF-1 and the 
protein fhaC of Bordetella pertussis (35% similarity, 27% 
identical over 401 amino acids) . ORF-1 and ORF-2 are 
neighbouring genes in the strain Z249I and filamentous 
haemagglutinin B of Bordetella pertussis and fhaC are 
neighbouring genes in Bordetella pertussis, which reinforces 
the probability that these homologies reflect functional 
homologies . 
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. Confirmation of the specificity of region 2 with 
respect to Nm 

Southern blots are carried out using the DNA probes 
obtained by PCR amplification of various parts of region 2 
using oligonucleotide primers formulated from sequences of 
clones of region 2 . 

The approximate position of these oligonucleotides is 

shown on figure 4. 

These are the oligonucleotides called R2001 (SEQ ID No. 
46) and R2 0 02 (SEQ ID No. 47) in one half of ORF-1, the 
oligonucleotides b332a (SEQ ID No. 48), el39a (SEQ ID No. 49), 
bl32a (SEQ ID No. 50) and b233b (SEQ ID No. 51) in one half of 
ORF-l+the majority of ORF-2, and the oligonucleotides el45a 
(SEQ ID No. 52) and blOla (SEQ ID No. 53) in 1/3 of 0RF-4+0RF- 
5 to 7. 

The three Southerns are carried out under the following 
hybridization conditions : 
16 h at 65°C, 
NaP0 4 0.5 M, pH 7.2 
EDTA-Na 0.001 M 
1% sodium dodecylsulphate . 

For the washing, heating is carried out for 10 min at 
65°C, and NaP0 4 0.5 M, pH 7.2; EDTA-Na 0.001 M, 1% sodium 
dodecylsulphate are used. 

Figures 5, 6 and 7 respectively show the Southern blots 
obtained with each of the abovementioned ORF parts. 

The 14 tracks correspond respectively, in each of the 
Southerns, to 
1: MS11 (Ng) 
2.: 403 (Ng) 
3 : FA1090 (Ng) 
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4: Wl (Ng) 
5: 6493 (Ng) 

6: marker (lambda hindlll) 
7: Z2491 (Nm, gpA) 
8: 7972 (Nm gpA) 
9: 8013 (Nm, gpC) 

10: 1121 (Nm, grouping not possible) 
11: 1912 (Nm, gpB) 
13: 32165 (Nc) 
14: 8064 (Nl) . 

Given that a panel of strains of Neisseria is used in 
these experiments and that each well is charged with a similar 
amount of digested DNA, these 3 Southern blots clearly show 
that the sequences corresponding to region 2 are found in all 
the meningococci tested and that significant homologous 
sequences do not exist in the genome of the Ng of the strains 
tested. 

Example 4 : Identification of regions of the Nm genome absent 
from Nl and common with Ng 

The technique described in example 1 is followed, but the 
chromosomal DNA of one strain of Nm (Z2491) and 2 strains of 
Nl (XN collections) , equal parts of the DNAs of which are 
mixed, is used. 

2 subtractions are performed using the R and J series of 
primers. Three different banks are thus obtained. 

Two banks, called Bam and Eco, are obtained respectively 
by digestion of the chromosomal DNA of Nm Z24 91 by Mbol and 
Tsp5091; a third bank, called Cla, which results from 
digestion of the chromosomal DNA of Nm by Mspl, is obtained 
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using the primer set RMsplO, RMsp24, JMsplO and JMsp24 . All 
the primers used are shown in the following table 2. 

Table 2 

Adapters for differential banks 

Chromosomal DNA digested by Cloning in 
pBluescript by 

Mbol ~> BamHI 

Tsp5 0 9I -» EcoRI 

Mspl Clal 

First subtraction cycle 

RBaml2 : 3 1 AGTGGCTCCTAG 5' (SEQ ID No. 54) 

RBam24 : 5' AGCACTCTCCAGCCTCTCACCGAG 3' (SEQ ID No. 55) 

REcol2 : AGTGGCTCTTAA (SEQ ID No . 56) 

RBam24 : 5 ! AGCACTCTCCAGCCTCTCACCGAG 3 1 (SEQ ID No. 55) 

(REco 24 = RBam 24) 
RMsplO : AGTGGCTGGC (SEQ ID No . 57) 

RMsp24 : 5 f AGCACTCTCCAGCCTCTCACCGAG 3 1 (SEQ ID No. 

58) 

Second subtraction cycle 

Jbaml2 : 3' GTACTTGCCTAG 5' (SEQ ID No . 59) 

JBam24 : 5' ACCGACGTCGACTATCCATGAACG 3 f (SEQ ID No . 60) 

JEcol2 : GTACTTGCTTAA (SEQ ID No. 61) 

JBam24 : 5' ACCGACGTCGACTATCCATGAACG 3 1 (SEQ ID No 60) 

(JEco 24 = JBam 24) 

JMsplO : GTACTTGGGC (SEQ ID No. 62) 

JMsp24 : 5 T ACCGACGTCGACTATCCATGAACC 3' (SEQ ID No . 63) 



REPLACEMENT SHEET (RULE 26) 



42 



After 2 subtractions, the entire product of each 
amplification is labelled and used as a probe* 

The subtract ive banks are checked by Southern blotting 
over a panel of 12 strains of Neisseria (chromosomal DNA cut 

by Clal) . The hybridization conditions are identical to those 

given in example 1. 

These Southern blots are shown on figures 8A to 8C, which 
relate respectively to the MboI/BamHI bank, to the Mspl/Clal 

bank and to the Tsp5091/ EcoRI bank. 

The 12 tracks correspond respectively, to 

1; Nm Z2491 (group A) 

2: Nl 8064 

3 : Nm 8216 (group B) 

4: Nl 9764 

5: Nm 8013 (group C) 

6: Ng MS11 

7: Nm 1912 (group A) 

8: Ng 4C1 

9: Nm 1121 (grouping not possible) 

10: Ng FA1090 

11: Nc 32165 

12: Nm 7972 (group A) 

Examination of the Southern blots shows that the 
sequences contained in each bank are specific to Nm and are 
not found in Nl . Furthermore, the reactivity found with the 
strains of Ng suggests that some of these sequences are 
present in Ng. 

Each of these banks was then cloned in pBluescript at the 
BamHI site for Bam, or the EcoRI sit for Eco, or the Clal site 
for Cla. In order to confirm the specificity of the clones 
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with respect to the Nm genome, restriction of the individual 
clones and radiolabelling thereof were carried out. The clones 
showing reactivity for both Nm and Ng were kept for subsequent 
studies. These clones are shown on figures 9, 10 and 11, which 
give the profiles with respect ot Nm, Nl and Ng of 5 clones of 
the Bam bank (figure 9) , 16 clones of the Eco bank (figure 
10) and 13 clones of the Cla bank (figure 11) . 

These clones were sequenced using universal and reverse 
primers . They are 

- Bam clones 

partial Bll of 140 bp (SEQ ID No. 66), partial B13 estimated 
at 425 bp (SEQ ID No. 67), B26 of 181 bp (SEQ ID No. 68), B33 
of 307 bp (SEQ ID No. 69), B40 of 243 bp (SEQ ID No. 70), 

- Cla clones 

C16 of 280 bp (SEQ ID No. 72) , partial C20 estimated at 365 bp 
(SEQ ID No. 73), partial C24 estimated at 645 bp (SEQ ID No. 
74), partial C29 estimated at 245 bp (SEQ ID No. 75), C34 of 
381 bp (SEQ ID No. 76), C40 of 269 bp (SEQ ID No. 77), C42 of 
203 bp (SEQ ID No. 78), p C43 of 229 bp (SEQ ID No. 79), C45 
of 206 bp (SEQ ID No. 80), C47 of 224 bp (SEQ ID No. 81), C62 
of 212 bp (SEQ ID No. 82), and C130 (5'...) estimated at 900 
bp (SEQ ID No. 83) , and 

- Eco clones 

E2 of 308 bp (SEQ ID No. 84), partial E5 estimated at 170 bp 
(SEQ ID No. 85), partial E22 estimated at 300 bp (SEQ ID No. 
86), E23 of 273 bp (SEQ ID No. 87), E24 of 271 bp (SEQ ID No. 
88), E29 of 268 bp (SEQ ID No. 89), partial E33 estimated at 
275 bp (SEQ ID No. 90), partial E34 estimated at 365 bp (SEQ 
ID No. 91), E45 of 260 bp (SEQ ID No. 92), E59 estimated at 
greater than 380 bp (SEQ ID No. 93), E78 of 308 bp (SEQ ID No. 
94), E85 of 286 bp (SEQ ID No. 95), E87 of 238 bp (SEQ ID No. 
96), partial E94 greater than 320 bp (SEQ ID No. 97), partial 
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E103 greater than 320 bp (SEQ ID No. 98) and E110 of 217 bp 

(SEQ ID No. 99) . 

Mapping of each clone was carried out on the chromosome 
of Nm Z24 91 as described in example 1. The results obtained 
are given on the right-hand part of figure 2. It is found that 
these clones correspond to regions called 4 and 5 . These 
regions are therefore made up of sequences present both in Nm 
and in Ng, but not found in Nl . It is therefore regarded that 
these are sequences which code for virulence factors 
responsible for the initial colonization and penetration of 
the mucosa. Region 4 is located between argF and regF on the 
chromosome of Nm 24 91 [sic] , and region 5 is located between 
the lambda 375 marker and penA. This region probably contains 
sequences which code for an Opa variant and a protein which 
binds transferrin . 

A comparison with the known sequences in the databanks 
has half [sic] that in region 4 only the clone C13 0 has a 
homology, that is to say with Mspl methylase. In region 5, no 
homology with known sequences was found with the clones C8 , 
E2, B40, C45, E23 and E103 . For the other clones, the 
homologies are the following: 

Bll arginine decarboxylase SpeA; C2 9 arginine 
decarboxylase SpeA; C62 oxoglutarate/malate transporter; 
repetitive DNA element; E34 repetitive DNA element; E94 murine 
endopeptidase MepA ; C47 citrate synthase PrpC; E78 citrate 
synthase PrpC 

Example 5; Demonstration of the presence of one — or — more 

strains of Neisseria meningitidis in a bio logical sample 

A biological sample of the cephalorachidian fluid, urine, 
blood or saliva type is taken. 

After filtration and extraction, the DNAs present in this 
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sample are subjected to gel electrophoresis and transferred to 
a membrane by Southern blotting. 

A nucleotide probe constructed by labelling SEQ ID No . 5 
with 32 P is incubated with this transfer membrane. 

After autoradiography, the presence of reactive band(s) 
allows diagnosis of the presence of Neisseria meningitidis in 

the sample. 

Example 6: Vaccine composition in cluding in its spectrum 

antimenincrococcal prophylaxis a n d intended for prevention of 
any form of infection bv Neisseria meningitidis. 

The peptide coded by a sequence including SEQ ID No. 10 
is conjugated with a toxin. 

This conjugated peptide is then added to a composition 
comprising the ant i- Haemophilus and antipneumococcal vaccine, 

or any other childhood vaccine. 

After having been sterilized, the resulting composition 
can be injected parenterally , subcutaneous ly or 

intramuscularly . 

This same composition can also be sprayed on to mucosa 

with the aid of a spray. 
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SEQUENCE LISTING 

CD GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: I.N.S.E.R.M 

(B) STREET: 101, rue de Tolbiac 

(C) CITY: PARIS CEDEX 13 

(E) COUNTRY: FRANCE 

(F) POSTAL CODE (ZIP) : 75654 

(ii) TITLE OF THE INVENTION: DNA, specific proteins and peptides of the 
Neisseria meningitidis species bacteria, methods for obtaining them and 
their biological applications. 

(iii) NUMBER OF SEQUENCES : 99 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
CB) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 (OEB) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 57 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
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(B) STRAIN: Z2491 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GATCCGCTGC CGGCAGACGA ATATCAAGAC ATCTTCGATT TTATGAAACA GTATGACTTG 



60 



TCTTACCCGT ATGAATATCT GCAGGATTGG ATAGATTACT ATACGTTCAA AACCGATAAG 



12 0 



CTGGTATTTG GTAACGCGAA GCGAGAGTGA GCCGTAAAAC TCTGAGCTCC TGTTTTATAG 



180 



ATTACAACTT TAGGCCGTCT TAAAGCTGAA AGATTTTCGA AAGCTATAAA TTGAAGCCCT 



240 



TCCACAGTAC ATAGATC 



257 



(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
<B) STRAIN: Z2491 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

GATCATGTTC AAATAGATAG GCATGGGAAG CTGCAGCTCT AACGTCCATG AAAATATGTT 60 

GCATAGCTGC AAGCGGAACG CCTTTTCTTT CATCTACATA AT CT ATAGAG TCAAGGCAAC 120 

CGCTATTGAA ATTAGCAGTA TTGCCTATGA TTACATTAGT AATATGCTCA TACCATTTTT 180 
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GGGTGGTCAT CATATTGTGC CCCATTGTTA TCTCCTTATA TTGGTTTTAG AAGGAACTTT 240 
GACAGGAAGA ATAACGGCCT TACCTGTTTG ACGATC 276 
(2) INFORMATION FOR SEQ ID NO : 3: 

<i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

GATCTGGTGG TGTTTGCACA GGTAGGCGCA TACTTGTTCG GGACTGAGTT TGCGGCGGAT 60 

AAGGGTGTCG ATGTGCTGAA TCAGCTGCGA ATCGAGCTTA TAGGGTTGTC GCTTACGCTG 120 

TTTGATAGTC CGGCTTTGCC GCTGGGCTTT TTCGGCGCTG TATTGCTGCC CTTGGGTGCG 180 

GTGCCGTCTG ATTTCGCGGC TGATGGTGCT TTTGTGGCGG TTAAGCTGTT TGGCGATTTC 24 0 

GGTGACGGTG CAGTGGCGGG ACAGGTATTG GATGTGGTAT CGTTCGCCTT GGGTCAGTTG 3 00 

CGTGTAGCTC ATGGCAATCT TTCTTGCAGG AAAGGCCGTA TGCTACCGCA TACTGGCCTT 360 

TTTCTGTTAG GGAAAGTTGC ACTTCAAATG CGAATCCGCC GACCTCTTTC AGTTACAGCA 42 0 

GCTTGATC 428 
(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 90 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

GATCCTGCAT TGACATCGGC CTTGGCTGTC AGGGTATTGT GAC CGGTAAA GTCGGCATTA 60 

CCGTTGGCCA ATAAGGATAC ATGACCGTCT GCAGAAACAG CATGAAGGCC GTCTGAAACG 120 

ATATTGCCCT GCAATGCGGT GGTTTCGAGA GCCTTGGCTG CGTTCAGCTT GGTATTGCGA 18 0 

AGCTGAATAT TGCCTTTGGC TGCCTGAATG TGCAGATTAC CCGAGTTGGT ACGCAGATTG 24 0 

GTATTGGTAA CATTCAGCAA GCCTGCCTCC ACACC CATGT CTTTTGAGGC AGTGAGGGTT 3 00 

TTACTGGTGC CGGTAATATG GGCAGCGTTA TCCGATTTCA AATGGATGCT GGCCGGCAGA 360 

CAAATCTTTA TCAACATTCA AATTCAGATC 390 
(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 7 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vi) ORIGIN: 

(A) ORGANISM; Neisseria meningitidis 

(B) STRAIN: 22491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GATCAGATTG GTGAAGACGG TATTACCGTC AATGTTGCAG GCCGTTCGGG ATATACGGCG 60 
AAAATCGACG TGTCTCCGAG TACCGATTTG GCGGTTTATG GCCATATTGA AGTTGTACGG 120 
GGTGCAACGG GGTTGACCCA ATCCAATTCA GAGCCGGGTG GAACCGTCAA TTTGATC 177 
(2) INFORMATION FOR SEQ . ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 base pairs 

(B) TYPE : nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

GATCAATGAT GCTACTATTC AAGCGGGCAG TTCCGTGTAC AGCTCCACCA AAGGCGATAC 60 

TGAATTGGGT GAAAATACCC GTATTATTGC TGAAAACGTA AC CGT ATT AT CTAACGGTAG 12 0 

TATTGGCAGT GCTGCTGTAA TTGAGGCTAA AGACACTGCA CACATTGAAT CGGGCAAACC 180 

GCTTTCTTTA GAAAC CTCGA CCGTTGCCTC CAACATCCGT TTGAACAACG GTAACATTAA 240 

AGGCGGAAAG CAGCTTGCTT TACTGGCAGA CGATAACATT ACTGCCAAAA CTACCAATCT 3 00 
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GAATACTCCC GGCAATCTGT ATGTTCATAC AGGTAAAGAT C 341 
(2) INFORMATION FOR SEQ ID NO : 7; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
GATCCAACTG TTTGATTTTA CTGGCTGCTT CTCCATGCGC GGTATTGACC AAAGCCGCAA 60 
GGATATTCGC TTCCAGATTG TCTTTCAGGC TGCCGCCGTT GACAGCGGTA TTAATCAGTG 20 
CGGCACTGCC CGCATTGGCT AGGTTGACGG TCAGGTTGTT GATC 164 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

GATCAATCAC ACATCTTGTC ATTTTTTCGA TTCCTTCATT TCGGTTTCTA ATGTTTCAAT 60 

TCTTGCGGCC ATTTCCTGAA TGGCTTTAGT CAAAACGGGG ATGAACGCTT CGTATTCGAC 12 0 

GGTGTAGGTA TCGTTTGTTT TATTTACCAT CGGCAATCGA CCATATTCAT CTTCCAGCGC 180 

AGCAATGTCC TGGGCAATAA ACCAATGCCG CAACCGATC 219 
(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 56 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GATCTTGGGT AAGCCCCCAA CCTGCATAGA AAGGCAGGCC GTAGCAGCTG ACTTTTTTGC 6 0 

CGCGCAACAA GGCTTCAAAA CCGGTCAGCG AAGTCATGGT ATGTATTTCG TCTGCGTATT 12 0 

GGAGACAGGT CAGGATGTCG GCTTGTTCGG CGGTTTGGTC GGCATATCGT GCAGCATCAT 18 0 

CAGGGGAAAT ATGGCCGATG CGGTTACCGC TGACTACATC GGGATGCGGT TTGTAGATGA 240 

TATAGGCATT GGGGTTTCGT TCGCGTACGG TACGGAGCAA ATCCAGATTG CGGTAGATTT 300 

GGGGCGAACC GTAGCGGATA GACGCATCAT CTTCAACCTG GCCGGGAACG AGGATC 3 56 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE : nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GATCCGCTTT CAGTTTCCGT ACCGGTGGCA TCAGTCAAGT CCGTTTTGTG CACCAAACCG 60 

CGTCCATATG AAACATAAAA CAAATCGCTT AAGC CCAAAG GGTTATCGAA CGATAAAGCG 12 0 

ACATTTCCTT GATATTTGCC GGTCGTTTTG CCGCCCGCAT CATCTATACC GATACTGAAC 18 0 

CGTATGGGTT TATTCTGCTG CCATTTGATC 210 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 9 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GATCCCGAAA CGCAATTGGT CGAAAGCTAT ATGCTGAACG ATGTGTTGCG GTTTTGGGAC 6 0 

AGCGCAGGTT TGGGCGATGG GAAAGAAGCC GACCGCGCCC ATCGGCAAAA ACTGATTGAT 120 

GTCCTGTCTA AAACCTATAC TCATTCGGAT GGGCAGTGGG GCTGGATAGA TTTGGTGTTC 180 

GTTATCCTTG ACGGCAGCTC CCGCGATTTG GGTACGGCCT ATGATTTGTT GAGGGATGTT 240 

ATCCTTAAAA TGATTGATC 259 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GATCAAATGG ATGATTTATA TAGAATTTTC TTTTACGACT GCGTGCCGTT TGAAAAGAAA 6 0 

ATGCACAATC CCGTATCTCA TCGTGCCATA GATTTTTCAA AGACTC CGGA AGC CATATTT 12 0 

CGTTGCAATC TGCAT AC CGA ATTGAAGAAG AAGCGTAAAT TAGCGTTACG TTTAGGCAAG 180 

CTGTCGGACA ATACAGCATG GATATTAAAA CCCCAAGTCA TGAAAAATCT TCTGAAAAAC 240 

CCGTCAACTC AAATTACGGA AAACGATGTC GTGCTCGATG TTAAACAAAA AGGTGTAGAT 3 00 
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ATGCGTATAG GCTTGGATAT TTCATCTATT ACCTTAAAAA AACAAGCCGA TAAAATCATC 360 

TTGTTTTCTG GTGATTCCGA TTTTGTCCCA GCAGCCAAAT TAGCCAGACG GGAAGGTATC 42 0 

GATTTTATTC TTGATC 43 6 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GATCGTTTTA CGTCGCAATC GAGCTTTGTG GTGCGCTCGC CTAAAAGCCA ATCTTCTCTC 60 

AATGGCCTGG GTGC CATTTT GCAGGGCACA GGTTTTGCCC GTGCGCAAGA CGATATTTAT 120 

ACCGTGCAGG AATATATGCA GTCGCGTTCG GCTTTGGATG CGTTGCGTAA GAAAATGCCC 180 

ATTCGCGATT TTTATGAAAA AGAAGGCGAT ATTTTCAGCC GTTTTAATGG TTTTGGCCTG 24 0 

CGTGGCGAGG ATGAGGCGTT TTATCAATAC TACCGTGATA AGGTATCCAT CCATTTTGAC 3 00 

TCTGTCTCAG GCATTTCCAA TTTGAGCGTT ACATCGTTTA ATGCCGGTGA ATCTCAAAAG 360 

ATC 363 
(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 314 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GATCTTGCGT CATTTATATC TTCACCGATA TTGCAATTAC CGCCGTTCCA GTTGAAATAA 60 

CAACGACTAA AATTGTAGTT CCTAAAAGAA TCATTCCTAT TCTTGCGTAC CATTTCCCAA 12 0 

TAATTGCGCC CGACAATTTC CATTTAATGC TCCATCAGTT CTTTTACTTC CGGAAATCTG 18 0 

CTGTAATCTG ACATAAGACG CATAATTGAA CTATCAACGC CGTAACAGCC ATAGGTTTTA 240 

ATACCGTTTT CGGCGTGTTC CCAAATGCAA TTACTGTATT CGTAGCCTTT TACAAATTTA 3 00 

TCGGTTTCGG GATC 314 
(2) INFORMATION FOR SEQ ID NO : 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: s.ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
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(B) STRAIN: Z2491 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GATCATACGA ATCTACCCTA AAATACCCCG TCGCCGATTT AGGATTGGCT ACATAAAGCT 60 
CATTATAAGG GTATTTTGAT GACATGATAC GGTTAAATTC ATTGCCGTTG TTTATCCTGA 120 
TTCTATAAAT TGGTTCAACA GCAAAGCCTC TGGATTCCCT TAATTGATTA TAATATTGCC 18 0 
TGTATGTTTG TACATCATGT CTTGTCCACG GCTCTCCAGG AGTCCTCAGA ATAGCAATCC 240 
CGTTAAATTT CGGATC 256 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN; Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GATCCACGCC TGTGCCTACC TTGGCTTTTT GTTCGCCAAA CAAGGCATTT AAGGTTGAGG 60 

ACTTGCCGAC ACCTGTCGCA CCGACAAGCA AGACATCCAA ATGACGGAAA CCGGCTGCTG 12 0 

TGACTTTTTG CCCGATTTCA GAAATACGGT AACGATGCAT ATGCGCTCCT ACCAGCCAAA 180 

AAAAGAAGCA ACCGTGCTAA TCGCCCCTCC AATCGCTTTT GCAGCACCGC CGATC 23 5 



REPLACEMENT SHEET (RULE 26) 



58 



(2) INFORMATION FOR SEQ ID NO : 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 9 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GATCCAACGG GCATCGCTGT CCTTACTCGG TGTGGTTTGA CCGCTGATTT GTCCTTCTTC 60 

GTCAACTTCT ATGGCCTGAC GCTGTTTGCT GCCGGCGGTC TGGATAATGG TGGCATCAAC 120 

GACGGCGGCG GATGCTTTCT CTATTTTTAG GCCTTTTTCG GTCAGTTGGC AGTTAATCAG 180 

TTTGAGTAAT TCGGACAGGG TGTCGTCTTG CGCCAGCCAG TTGCGGTAGC GGCATAAGGT 240 

ACTGTAATCG GGGATGATC 259 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 01 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
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(B) STRAIN: Z2491 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GATCTGTGCC GTTGATTTTA TCTTTCAGAT GCAGCATCGA ATATCGGAAA GCCAAATCAG 60 
CAATTCTTTT TGCATCGTGT GGATTTTGAG ACGGGCCTAA TGACCGTACC CGCTTAATAA 120 
AAAATGCACC GTCAATCAAA ATGGCGGTTT TCATATTGCT TCCCCTATAT TTGTCAAAGA 180 
TATAAAAAAG CCCTTGGGAT C 201 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleotide 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
<B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AATTCAAAGG AGGCATTTGT TGCAAGAAAA GTACAAAGTG ATTTGCAAAA AGCATTGAAT 60 

GCTAGCAACT ATAACAAGCA GCAATATGCA AGACGTGCGG CAACAGCGTT AGAGAATGCT 120 

TCAAAATCAA AAGTTATGGC AGCGAATTCT TTTTGATCTA TCTTGTGCGA ACGGGTCAAA 180 

TATTCTTCGT ACATTGAGTT AATCGTACCA ATCGCCCTAA CCACATTTTC ATCAGAAAAT 240 

ATGGAAATAA TAGCATCCCT ATACGCACCT AGTGTAATAT TGTTTCTATT ATTAGTTATA 3 00 
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GCATTATTCG AATACATAAT AGCACCTCCA AATT 334 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AATTCCTGCG CACCTTTGCC GATGGGGAGA TAATCGCCTT TTTGCAGCAT TCTGCCCTGA 6 0 

TGGCCGCCGA AACCGGCTTT CAGGTCGGTA CTTCTCGAAC CCATCACTTC CGGCACATCA 12 0 

AATCCGCCCG CCACGCACAC ATAGCCGTAC ATGCCCTGCA CGGCACGCAC CAGTTTCAAG 180 

GTCTGCCCTT TGCGGGCGGT ATAACGCCAA TACGAATAGA CCGGTTCGCC GTCCAATT 238 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
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(B) STRAIN: Z2491 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 

AATTGGGCGA GATGCTGCCG GAAACGGATT TAAAACAGAT TGCGGCGGCA GTGTTGAAGA 60 

CGAACGATGA GGCGGCATTG CAGAAGGTGG TGAAAACGGC CAAAGGCAAT GCGCGGAAAC 12 0 

TGTCGAAGCT GCTGCTGATT GTGGACTATT TGTTGCAGGT TAACCCTGAT GTTGATTTGG 180 

ATGATGATGT AATCGAACAC GCGGAAACCT ATTTAATCCA CTAAACCTTT GACAGATAAG 240 
GCAATAATT 24 9 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

AATTTATGTA CGGTTTTGCC GTTTGCAGTC AGCCAGTCGG CAAGGCGCAG AAAAAAATCG 60 

CCGACAGGGC CTTGAAGCAG CAGGATATTT TCTGCGCTTT CAAGCAGGTT TTGCAGGTTA 120 

TTTTTGAGGA CGGTCTGTTT CATGTTGCAA TGTGGTTTTG TTTTTTATGT AATAGTTTTA 18 0 

GGTTGAACTT TCAAGCATAC GCCAAGAGAA TT 212 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 7 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

AATTCAGTGC CTGCGTCATA TCACGGCTAC CTTGTGGTTC AGGGTTACTG TATCGCCCGC 60 

GGCATCGACG GCTTCAATAT GCAGCTTCAG CCAGCCGTGC TGCGGGGCGG ATGCGGTTAC 120 

TTGGATGGAT TGGGCGCGTT TGGACTGAAT CACGGGCTGC AAGGCTTGCT CGGCGTACTG 180 

TTTGGCCAGT ACTTCGATGC GCTTTAAATG CTTTTGGCGG CGCAATT 22 7 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GATCCAGGAC TCAAAAACCG ATTTCCTAAT AGAGTGTCTA ATATCCCAAT CTTTTTTACC 60 
CCCTCTGCTG TAGAATTGAT AGAGAAAGTT TGTCTATCTT TTTCATATAC CCATGCCTTC 120 
TTTTTATCAT TGTAGCTAAC ATAACCGCCA AACAATGCTT CTAGATC 16 7 

(2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

( D ) TO POLOGY : linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AATTCTTGCG GCCATTTCCT GAATGGCTTT AGTCAAAACG GGGATGAACG TTTCGTATTC 60 

GACGGTGTAG GTATCGTTTG TTTTATTTAC CATCGGCAAT CGACCATATT CATCTTCCAG 12 0 

CGCAGCAATG TCCTGGGCAA TAAACCAATG CCGCAACCGA TCTTCTTTAT GACTGCCGTC 18 0 

CTTGATTGGA TTCGCCCACC ATTCGCGGAC TTTGTCCGCT CGTTCATCTG CCGGCAAGTC 24 0 

TTTGAATAAT T 2 51 

(2) INFORMATION FOR SEQ ID NO : 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 
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(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AATTCCCGAC TATCGCGGAT GCGTAGTTTT TGCCGGTGGG CAAGAGCAGG TGTGGGATAA 60 

GTTAGGTGAT TTGCCCGATG GCGTCAGCCT GACCCCGCCT GAATCGGTAA ATATTGACGG 12 0 

CTTAAAATCC GTAAAACTCG TCGCATTAAA TGCTGCCGCT CAGGCTTTTA TTAACAAGCA 18 0 

CGCCGGTATC GACAGCGTAC CTGAATT 2 07 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 79 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: 22491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AATTGTTTGG GAATAATCCA AACAAACAGC ATCAGGATAG CGGCGGCGGT CAGGCTGCCT 6 0 
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GAAAGGATTT TGCCGGGGTT TTTTGTAGGC AAAGCGGACG AGAAACCAAA GCAACAGCAG 12 0 

CATGGTGTCC CAATAGCCGA TTGAGAATAG GATGGCCAAA CCTTCTAGGA AATGGCGTAA 180 

ATCGTTTGTG GTAACCATGG GTAGTTCCTG TGGTTAAATG TGCAGGCTGC TTTTTGCCGA 240 

ACCTTGCCGC ATCTCAAAAG CAGCCTGCGC TTCAGCGTTG CGTTACGCAG TAAAATAATG 3 00 

AATATTTGTA ACGGCTTGGG TATTTTTTGT CAATATTCCC GCCCTTCCCT TAACAGCTGC 360 

CGCGCTTTCC GTTAAAATT 3 79 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 74 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM : Neisseria meningitidis 

(B) STRAIN: Z2491 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

AATTCGCCGA AATCAGGCTG CTGCTCGATA ATCGGCGCGG CCGATTGGCG TTGTGCCTCG 60 

ATTAAATCCA TCTTGTCTTG CAGACGTTTG GCCTGGCCTT TGCGGCGGCG TTCGGCCAGT 12 0 

TGTTCCATCC GCGTTTCCGC AAATGCCGCC CGTTTGTTGC CGTTGAATAC CGCTTTGCAA 18 0 

ATCACCTTGC CCTGCATATC CTTCACAATC ACATGGTCGG CATCGTGGAT GTCGTAAGCC 24 0 

ACCCGTACCT TCTGACCGCT GTAATC CAGC AATT 274 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 29: 

AATTCCGTTC TTATTGGGCT TTTTCCATCC ATCGGGTATG CCTGAAGGGA ACGCAAACCC 60 

TGCCACTTGC CCATCGCTCC ATTCCCGCAT TAGCGCGTCT GACGGCAAGT GTTCTCGCGC 120 

CCAATCAAGC CACGCCTGCC GCATTGCGGC CTTGTCCTGC TGAAAACTTC GCAGTGCTTT 18 0 

TGCAACCGGC CCATCATTAA CTTCAATCAA ATAAATCATT ATATTTGCGT TCATTTTTCC 24 0 

TACACCTTCG CCACATCCAA ATT 2 63 
(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
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(B) STRAIN: 22491 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

AATTGTTCAA GAAAAAAGTC GGCACGGCGC GGCAACGGGG AAAATGCGTT GACGCCGTCT 60 

TTTTCTAAGG TGATGTAGTA GGGGCGGAAA TAGCCTTCTT CAAACGCCCA GAAACTGGCT 12 0 

TGGTTTTCGT TTGCAATGCG TTTTGCAATG ACGTGATAAG GGCGTGTGTC GCCAAAGCAG 18 0 

ACAACGGCCT GGATGTGATG TTGAGTGATG TATTCTTGCA AAAACTCAGG AAAGGCGTCG 240 

TAGTTGTCGT TAAAAACAAC GGTATGCGCT TGAGTGGGCG GATAAAAATA GTCGTCGCCT 3 00 

GCATTAAAGT TGAATT 316 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 base pairs 

(B) TYPE: nucleotide 

( C ) S TRANDEDNES S : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

AATTCAATCA ACGGAAAACA CATCAGCATC AAAAACAACG GTGGTAATGC CGACTTAAAA 60 

AACCTTAACG TCCATGCCAA AAGCGGGGCA TTGAACATTC ATTCCGACCG GGCATTGAGC 120 

ATAGAAAATA CCAAGCTGGA GTCTACCCAT AATACGCATC TTAATGCACA ACACGAGCGG 180 
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GTAACGCTCA ACCAAGTAGA TGCCTACGCA CACCGTCATC TAAGCATTAC CGGCAGC C AG 240 

ATTTGGCAAA ACGACAAACT GCCTTCTGCC AACAAGCTGG TGGCTAACGG TGTATTGGCA 3 00 

CTCAATGCGC GCTATTCCCA AATT 324 
(2) INFORMATION FOR SEQ ID NO : 32: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 230 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

AATTATGCAA AAAAACGCAA CGCCGAAAAA CTGGCACCGC GCGGATATTG TTGCTGCTTT 60 

GAAAAAGAAA GGCTGGTCAC TTCGAGCACT TTCAATAGAA GCGGGGTTGT CGC CGAATAC 120 

GCTTAGAAGC GCACTGGCCG CCCCTTATCT TAAGGGAGAA AGGATTATTG CCGCTGCAAT 18 0 

CGGAGTGGAA CCGGAAGAGA TTTGGTCCGA ACGGTATGCA GATCGGAATT 23 0 
(2) INFORMATION FOR SEQ ID NO : 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 33: 

AATTTAATCG GTGGAATGCC TGTTCAACCG CACCAATCCC GCTGAATACG GTTGCTAATC 6 0 

TAATATGTGA ATCAGGTTTA AGAAAAGTTT TAGATTTCCA ACCTTGTTGA CTGGGAAAGA 12 0 

GCAAAGTTTT TTGTAATCGA GTATCGTGTG TCTGTGCCAT TGTCGAAATA GTCATACTTA 180 

TATCGTTCTG TTTATCTTAT CAATATGAAA ACTACATCGT TGATTGCCCT GACAATGCCT 24 0 

TGGTCAATT 24 9 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: ' 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
AATTCTTGTC CCGGAGTCCA ACGTATATTT ACCCTCCTGC GAGCTAAAAG ACTATTATTC 6 0 

TCCACTGCCA CAGTAGC CGC ATTCACCGCC GTATTCACAT CCCCTTTAAC C AATGC C ACT 12 0 
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GCGCTGCCTG CGATAATCTG CGAGTAGGCT ATGACTTTTT GGCGTTCTTG GGGTGACAGT 180 

TTGCCTACAT CGCGTCCGTC CAACAGGGTT TCTCCCACCA TCTCGCCGAC TGCCGCGCCG 240 

ATTGCGCCGT CCCGACATTT GCCTTTATTT GCTACCGCCG ATGCACAGCC TGCTACGGCA 300 

TGGGCTATCT TGTGGGCAAT GTAGTCTTCG CTGAGATTAA ATT 343 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

AATTCTTCAA ACATCGTTTC GATAATCGGG TCGGTGTACA CACTGATGCG GTCGCCCGCA 60 

CGGCTTTGAC CGGCTCGGAA AATATAGGCG GTGGCTTTGC CGTCGGCGAT GTCGACGCAC 12 0 

CAACGCCAGA TGGCGTCTTC GGTATTCAAA CAATCACCCG CACAGCTTTC ACCTGCGCGG 18 0 

AATT 184 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TATGCTCAAT CTCATTTTCA AAATGCAAAA CTTTTCTGAT TTTTCCTACT TTTTGCTCAA 60 

TATTAGGAAG GTTTTAGGCA ATTGAAAATT TTTTGGCGCA TTTTTATGCG TCAAATTTCG 12 0 
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TTAACAGACT ATTTTTGCAA AGGTCTCCGT CTGTAAAAGC AAGGATAGGG CATCTGCCCT 180 

TTTGATTGTT TGATTAACGA TACAAGGAGT TTCAAAATGA GAGTTTTATA GTGGATTAAC 24 0 

AAAAACCAGT ACAGCGTTGC CTCGCCTTGC CGTACTATTT GTACTGTCTG CGGCTTCGTC 3 00 

GCCTTGTCCT GATTTAAATT TAATCCACTA TATGTGTTCA TGAAATGACT TGGGTCGGAG 360 

GCTCAGGTAA TGCGCAACAA AGTTCATATT ATTGCGAAAT TTGCGAATCT GCAGGGCTTA 42 0 

ACGATACGGG AAATCCTGAT AAATCTTTAG GATTGC C AAA CAATACGTTC AGTAATCCGC 480 

CTGGTTGGGG AGCTACAATC GGAGCTTTAG CAGGTAGCCG CATAGGTATG CCTGAATTTG 540 

GTACGTTTGG GAGCCATGCC ATTGAAAATT TCGACTGGTC ATGGTATCGA CGTTATAGGG 6 00 

AAATTGCCGA AACGATTGAA CGAGAATATT CAGGCGGTTT GCCTTAATAG TTGAGGAGGT 660 

CATGATGTTT GCCAAACATT ATCAATTCAT CGCACTCGGC ATCATGCTGC TTCTTTATAT 72 0 

GTTGATTCTC TATACGACCG ATTTTTCCAA TCTGACGTAT TGGATGCTGT TTTTTATCTG 78 0 

TTTTATTACA GGAAAAATAT TAGCTCGTTT GTTAGAGAAA AGCTTTAAAT AAAATAGCAG 840 

CTAGTCGCAA AAGGTCGTCT GAAACCTTTT CAGGCGGCCT TTCTAAAATA CATCCAACTT 900 

CCTAATCCCT ATTTTTCAAA AAGGAAATCT ATGCCCCATC TGCAAAACCT GTCTTTGGGC 96 0 

TTAAAGAAAA AGCTGCCTGT TAT C CTGCAA ACAGAAATAT CAGAATGCGG CTTGGCATGT 102 0 

CTGGCGGCTG TGGCGGGATT TCATGGTTTC CATACGAATT TACGCGCACT GCGTTCAAAA 1080 

TACTGTCCGA GACCTTTGCA AAATTCCCCA AAATCCCCTA AATGTCTTGG TGGGAATTTT 114 0 

GGGGAATTTT GCAAAGGTCT CATTCTATAA CTGTAAATAC TTTTAAATTT ATGACAAAAT 1200 

AGTAAATATT GCTAAAATAA TATTGATGTC ATGAAATTTT TTCCTGCTCC ATGTCTGTTG 126 0 
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GTTATCCTGG CTGTCATACC CCTTAAAACC TTAGCTGCCG ATGAAAACGA TGCAGAACTT 1320 

ATCCGTTCCA TGCAGCGTCA GCAGCACATA GATGCTGAAT TGTTAACTGA TGCAAATGTC 13 80 

CGTTTCGAGC AACCATTGGA GAAGAACAAT TATGTCCTGA GTGAAGATGA AACACCGTGT 1440 

ACTCGGGTAA ATTACATTAG TTTAGATGAT AAGACGGCGC GCAAATTTTC TTTTCTTCCT 15 00 

TCTGTGCTCA TGAAAGAAAC AGCTTTTAAA ACTGGGATGT GTTTAGGTTC CAATAATTTG 1560 

AGCAGGCTAC AAAAAGCCGC GCAACAGATA CTGATTGTGC GTGGCTACCT CACTTCCCAA 1620 

GCTATTATCC AACCACAGAA TATGGATTCG GGAATTCTGA AATTACGGGT ATCAGCAGGC 1680 

GAAATAGGGG ATATCCGCTA TGAAGAAAAA CGGGATGGGA AGTCTGCCGA GGGCAGTATT 1740 

AGTGCATTCA ATAACAAATT TCCCTTATAT AGGAACAAAA TTCTCAATCT TCGCGATGTA 18 00 

GAGCAGGGCT TGGAAAAC CT GCGTCGTTTG CCGAGTGTTA AAACAGATAT TCAGATTATA 1860 

CCGTCCGAAG AAGAAGGCAA AAGCGATTTA CAGATCAAAT GGCAGCAGAA TAAACCCATA 1920 

CGGTTCAGTA TCGGTATAGA TGATGCGGGC GGCAAAACGA CCGGCAAATA TCAAGGAAAT 1980 

GTCGCTTTAT CGTTCGATAA CCCTTTGGGC TTAAGCGATT TGTTTTATGT TTCATATGGA 2 040 

CGCGGTTTGG TGCACAAAAC GGACTTGACT GATGCCACCG GTACGGAAAC TGAAAGCGGA 2100 

TCCAGAAGTT ACAGCGTGCA TTATTCGGTG CCCGTAAAAA AATGGCTGTT TTCTTTTAAT 2160 

CACAATGGAC ATCGTTACCA CGAAGCAACC GAAGGCTATT CCGTCAATTA CGATTACAAC 2220 

GGCAAACAAT ATCAGAGCAG CCTGGCCGCC GAGCGCATGC TTTGGCGTAA CAGGTTTCAT 2280 

AAAACTTCAG TCGGAATGAA ATTATGGACA CGCCAAACCT ATAAATACAT CGACGATGCC 2 340 

GAAATCGAAG TGCAACGCCG CCGCTCTGCA GGCTGGGAAG CCGAATTGCG CCACCGTGCT 2400 
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TACCTCAACC GTTGGCAGCT TGACGGCAAG TTGTCTTACA AACGCGGGAC CGGCATGCGC 246 0 

CAAAGTATGC CCGCACCTGA AGAAAACGGC GGCGGTACTA TTCCAGGCAC ATCCCGTATG 2520 

AAAATCATAA CCGCCGGATT GGATGCAGCG GCCCCGTTTA TGTTGGGCAA ACAGCAGTTT 258 0 

TTCTACGCAA CCGCCATTCA AGCTCAATGG AACAAAACGC CTTTGGTTGC CCAAGACAAG 264 0 

TTGTCTATCG GCAGCCGCTA CACCGTTCGC GGATTTGATG GGGAGCAGAG TCTTTTCGGA 2 70 0 

GAGCGAGGTT TCTACTGGCA GAATACTTTA ACTTGGTATT TTCATCCGAA CCATCAGTTC 2 760 

TATCTCGGTG CGGACTATGG CCGCGTATCT GGCGAAAGTG CACAATATGT ATCGGGCAAG 282 0 

CAGCTGATGG GTGCAGTGGT CGGCTTCAGA GGAGGGCATA AAGTAGGCGG TATGTTTGCT 28 80 

TATGATCTGT TTGCCGGCAA GCCGCTTCAT AAACCCAAAG GCTTTCAGAC GACCAACACC 2 94 0 

GTTTACGGCT TCAACTTGAA TTACAGTTTC TAACCTCTGA ATTTTTTTAC TGATATTTAG 3 0 00 

ACGGTCTTTC CTTATCCTCA GACTGTCAAA CTTTACCTAC GTACTTGGCG CGCAGTACGT 3 060 

TCATCTTCAA AATGGAATAG ACATGAATAA AGGTTTACAT CGCATTATCT TTAGTAAAAA 312 0 

GCACAGCACC ATGGTTGCAG TAGCCGAAAC TGCCAACAGC CAGGGCAAAG GTAAACAGGC 318 0 

AGGCAGTTCG GTTTCTGTTT CACTGAAAAC TTCAGGCGAC CTTTGCGGCA AACTCAAAAC 324 0 

CACCCTTAAA ACCTTGGTCT GCTCTTTGGT TTCCCTGAGT ATGGTATTGC CTGCCCATGC 33 00 

CCAAATTACC ACCGACAAAT CAGCACCTAA AAACCAGCAG GTCGTTATCC TTAAAAC CAA 33 60 

CACTGGTGCC CCCTTGGTGA ATATCCAAAC TCCGAATGGA CGCGGATTGA GCCACAACCG 342 0 

CTATACGCAG TTTGATGTTG ACAACAAAGG GGCAGTGTTA AACAACGACC GTAACAATAA 3480 

TCCGTTTCTG GTCAAAGGCA GTGCGCAATT GATTTTGAAC GAGGTACGCG GTACGGCTAG 3 54 0 
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CAAACTCAAC GGCATCGTTA CCGTAGGCGG TCAAAAGGCC GACGTGATTA TTGCCAACCC 3 6 00 

CAACGGCATT ACCGTTAATG GCGGCGGCTT TAAAAATGTC GGTCGGGGCA TCTTAACTAT 3 660 

CGGTGCGCCC CAAATCGGCA AAGACGGTGC ACTGACAGGA TTTGATGTGC GTCAAGGCAC 3 720 

ATTGACCGTA GGAGCAGCAG GTTGGAATGA TAAAGGCGGA GCCGACTACA CCGGGGTACT 3780 

TGCTCGTGCA GTTGCTTTGC AGGGGAAATT ACAGGGTAAA AACCTGGCGG TTTCTACCGG 3 840 

TCCTCAGAAA GTAGATTACG CCAGCGGCGA AATCAGTGCA GGTACGGCAG CGGGTACGAA 3 900 

ACCGACTATT GCCCTTGATA CTGCCGCACT GGGCGGTATG TACGCCGACA G C AT C AC AC T 3 960 

GATTGCCAAT GAAAAAGGCG TAGGCGTCAA AAATGCCGGC ACACTCGAAG CGGCCAAGCA 4 020 

ATTGATTGTG ACTTCGTCAG GCCGCATTGA AAACAGCGGC CGCATCGCCA CCACTGCCGA 4 0 80 

CGGCACCGAA GCTTCACCGA CTTATCTCTC CATCGAAACC ACCGAAAAAG GAGCGGCAGG 414 0 

CACATTTATC TCCAATGGTG GTCGGATCGA GAGCAAAGGC TTATTGGTTA TTGAGACGGG 42 00 

AGAAGATATC AGCTTGCGTA ACGGAGCCGT GGTGCAGAAT AACGGCAGTC GCCCAGCTAC 4260 

CACGGTATTA AATGCTGGTC ATAATTTGGT GATTGAGAGT AAAACTAATG TGAACAATGC 4320 

CAAAGGCTCG GCTAATCTGT CGGCCGGCGG TCGTACTACG ATCAATGATG CTACTATTCA 43 8 0 

AGCGGGCAGT TCCGTGTACA GCTCCACCAA AGGCGATACT GAATTGGGTG AAAATACCCG 4440 

TATTATTGCT GAAAACGTAA CCGTATTATC TAACGGTAGT ATTGGCAGTG CTGCTGTAAT 450 0 

TGAGGCTAAA GACACTGCAC ACATTGAATC GGGCAAACCG CTTTCTTTAG AAACCTCGAC 4560 

CGTTGCCTCC AACATCCGTT TGAACAACGG T AAC AT T AAA GGCGGAAAGC AGCTTGCTTT 46 2 0 

ACTGGCAGAC GATAACATTA CTGCCAAAAC TACCAATCTG AATACTCCCG GCAATCTGTA 468 0 
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TGTTCATACA GGTAAAGATC TGAATTTGAA TGTTGATAAA GATTTGTCTG CCGCCAGCAT 4740 

CCATTTGAAA TCGGATAACG CTGCCCATAT TACCGGCACC AGTAAAACCC TCACTGCCTC 4800 

AAAAGACATG GGTGTGGAGG CAGGCTTGCT GAATGTTACC AATACCAATC TGCGTACCAA 4 860 

CTCGGGTAAT CTGCACATTC AGGCAGCCAA AGGCAATATT CAGCTTCGCA AT AC C AAGCT 4 92 0 

GAACGCAGCC AAGGCTCTCG AAACCACCGC ATTGCAGGGC AATATCGTTT CAGACGGCCT 4 980 

TCATGCTGTT TCTGCAGACG GTCATGTATC CTTATTGGCC AACGGTAATG CCGACTTTAC 5040 

CGGTCACAAT ACC CTGACAG CCAAGGCCGA TGTCAATGCA GGATCGGTTG GTAAAGGCCG 5100 

TCTGAAAGCA G AC AAT AC C A ATATCACTTC ATCTTCAGGA GATATTACGT TGGTTGCCGG 5160 

CAACGGTATT CAGCTTGGTG ACGGAAAACA ACGCAATTCA ATCAACGGAA AACACATCAG 522 0 

CATCAAAAAC AACGGTGGTA ATGCCGACTT AAAAAACCTT AACGTCCATG CCAAAAGCGG 528 0 

GGCATTGAAC ATTCATTCCG ACCGGGCATT GAGCATAGAA AATACCAAGC TGGAGTCTAC 5340 

CCATAATACG CATCTTAATG CACAACACGA GCGGGTAACG CTCAACCAAG TAGATGCCTA 5400 

CGCACACCGT CATCTAAGCA TTACCGGCAG CCAGATTTGG CAAAACGACA AACTGCCTTC 546 0 

TGCCAACAAG CTGGTGGCTA ACGGTGTATT GGCACTCAAT GCGCGCTATT CCCAAATTGC 552 0 

CGACAACACC ACGCTGAGAG CGGGTGCAAT CAACCTTACT GCCGGTACCG CCCTAGTCAA 5 58 0 

GCGCGGCAAC ATCAATTGGA GTACCGTTTC GACCAAGACT TTGGAAGATA ATGC CGAATT 5640 

AAAACCATTG GCCGGACGGC TGAATATTGA AGCAGGTAGC GGCACATTAA CCATCGAACC 57 00 

TGCCAACCGC ATCAGTGCGC ATACCGACCT GAGCATCAAA ACAGGCGGAA AATTGCTGTT 5760 

GTCTGCAAAA GGAGGAAATG CAGGTGCGCC TAGTGCTCAA GTTTCCTCAT TGGAAGCAAA 582 0 
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AGGCAATATC CGTCTGGTTA CAGGAGAAAC AGATTTAAGA GGTTCTAAAA TTACAGCCGG 5 880 

TAAAAACTTG GTTGTCGCCA CCACCAAAGG CAAGTTGAAT ATCGAAGCCG TAAACAACTC 5940 

ATTCAGCAAT TATTTTCCTA CACAAAAAGC GGCTGAACTC AACCAAAAAT CCAAAGAATT 6000 

GGAACAGCAG ATTGCGCAGT TGAAAAAAAG CTCGCCTAAA AGCAAGCTGA TTCCAACCCT 6060 

GCAAGAAGAA CGCGACCGTC TCGCTTTCTA TATTCAAGCC ATCAACAAGG AAGTTAAAGG 6120 

TAAAAAACCC AAAGGCAAAG AATACCTGCA AGCCAAGCTT TCTGCACAAA ATATTGACTT 6180 

GATTTCCGCA CAAGGCATCG AAATCAGCGG TTCCGATATT ACCGCTTCCA AAAAACTGAA 624 0 

CCTTCACGCC GCAGGCGTAT TGCCAAAGGC AGCAGATTCA GAGGCGGCTG CTATTCTGAT 63 00 

TGACGGCATA ACCGACCAAT ATGAAATTGG CAAGCCCACC TACAAGAGTC ACTACGACAA 63 60 

AGCTGCTCTG AACAAGCCTT CACGTTTGAC CGGACGTACG GGGGTAAGTA TTCATGCAGC 642 0 

TGCGGCACTC GATGATGCAC GTATTATTAT CGGTGCATCC GAAATCAAAG CTCCCTCAGG 648 0 

CAGCATAGAC ATCAAAGCCC ATAGTGATAT TGTACTGGAG GCTGGACAAA ACGATGCCTA 654 0 

TACCTTCTTA AAAACCAAAG GTAAAAGCGG CAAAATCATC AGAAAAACCA AGTTTACCAG 660 0 

CACCCGCGAC CACCTGATTA TGCCAGCCCC CGTCGAGCTG ACCGCCAACG GTATCACGCT 6660 

TCAGGCAGGC GGCAACATCG AAGCTAATAC CACCCGCTTC AATGCCCCTG CAGGTAAAGT 6 72 0 

TACCCTGGTT GCGGGTGAAG AGCTGCAACT GCTGGCAGAA GAAGGCATCC ACAAGCACGA 6780 

GTTGGATGTC CAAAAAAGCC GCCGCTTTAT CGGCATCAAG GTAGGTAAGA GCAATTACAG 6840 

TAAAAACGAA CTGAACGAAA CCAAATTGCC TGTCCGCGTC GTCGCCCAAA CTGCAGCCAC 6 900 

CCGTTCAGGC TGGGATACCG TGCTCGAAGG TACCGAATTC AAAACCACGC TGGCCGGTGC 6960 
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CGACATTCAG GCAGGTGTAG GCGAAAAAGC CCGTGTCGAT GCGAAAATTA TCCTCAAAGG 702 0 

CATTGTGAAC CGTATC CAGT CGGAAGAAAA ATTAGAAACC AACTCAACCG TATGGCAGAA 708 0 

ACAGGCCGGA CGCGGCAGCA CTATCGAAAC GCTAAAACTG CCCAGCTTCG AAAGCCCTAC 714 0 

.TCCGCCCAAA TTGTCCGCAC CCGGCGGCTA TATCGTCGAC ATTCCGAAAG GCAATCTGAA 7200 

AACCGAAATC GAAAAGCTGT CCAAACAGCC CGAGTATGCC TATCTGAAAC AGCTCCAAGT 726 0 

AGCGAAAAAC ATCAACTGGA ATCAGGTGCA GCTTGCTTAC GACAGATGGG ACTACAAACA 7320 

GGAGGGCTTA AC CGAAGCAG GTGCGGCGAT TATCGCACTG GCCGTTACCG TGGTCACCTC 738 0 

AGG CGCAGGA ACCGGAGCCG TATTGGGATT AAACGGTGCG GCCGCCGCCG CAACCGATGC 744 0 

AGCATTCGCC TCTTTGGCCA GCCAGGCTTC CGTATCGTTC AT C AAC AAC A AAGGCGATGT 750 0 

CGGCAAAACC CTGAAAGAGC TGGGCAGAAG CAGCACGGTG AAAAATCTGG TGGTTGCCGC 7 56 0 

CGCTACCGCA GGCGTAGCCG ACAAAATCGG CGCTTCGGCA CTGAACAATG TCAGCGATAA 762 0 

GCAGTGGATC AACAAC CTGA CCGTCAACCT AGCCAATGCG GGCAGTGCCG CACTGATTAA 768 0 

TACCGCTGTC AACGGCGGCA GCCTGAAAGA CAATCTGGAA GCGAATATCC TTGCGGCTTT 7 74 0 

GGTCAATACC GCGCATGGAG AAGCAGCCAG TAAAATCAAA CAGTTGGATC AGCACTACAT 78 00 

AGTCCACAAG ATTGCCCATG CCATAGCGGG CTGTGCGGCA GCGGCGGCGA ATAAGGGCAA 78 60 

GTGTCAGGAT GGTGCGATAG GTGCGGCTGT GGGCGAGATA GTCGGGGAGG CTTTGACAAA 7 92 0 

CGGCAAAAAT CCTGACACTT TGACAGCTAA AGAACGCGAA CAGATTTTGG CATACAGCAA 7980 

ACTGGTTGCC GGTACGGTAA GCGGTGTGGT CGGCGGCGAT GTAAATGCGG CGGCGAATGC 8 04 0 

GGCTGAGGTA GCGGTGAAAA ATAATCAGCT TAGCGACAAA GAGGGTAGAG AATTTGATAA 8100 



REPLACEMENT SHEET (RULE 26) 



78 



CGAAATGACT GCATGCGCCA AACAGAATAA 
AAAGTATCAA AATGTTGCTG ATAAAAGACT 
ATCCCGTAGT ACTGAATGTA GAACAATCAG 
TCATTCATCT TGGGAAGCAG GTCTAATTGG 
CAAATCTTAC ACCCAAGCAG ATTTGGCTTT 
ATCTTGGCTT CAATCGGGCA ATACAAAGCC 
TACACTTATT TCAGGAGTTA ATCCTAGATT 
_ AAATACACCT ATTACTAATG TCAAATACCC 

yJ: AAGACATCTG GCAAATGCTG ATGGTTTTAG 

4" CCGCACCAAT TTTATGGCAG AACTAAATTC 

u i 

AACTGATATT GAAGGCATTA CCCGAATTAA 
TAAAC CTGAT GGTGGATTTA AGGAAATTTC 
£7. ATTTTCTGAT GATAAAATAC TTCAAATGGC 

AGCCTCTAAA ATTGCTCAAA ATGAAAGAAC 
TCAATTCTCA GAAACCTTTG ACGGAATCAA 
AAGAATTACA AACATTCACC CAGAATAATT 
TCTAAACTTA AATAAAAAAT CTATAAATAA 
AACAATTTAC CAATTTGAAA CTAAAGATAC 
TACCGGACAT TTTGGAGTAA TAGGTGCTCA 



TCCTCAACTG TGCAGAAAAA ATACTGTAAA 8160 

TGCTGCTTCG ATTGCAATAT GTACGGATAT 8220 

AAAACAACAT TTGATCGATA GTAGAAGCCT 828 0 

TAAAGATGAT GAATGGTATA AATTATTCAG 8340 

ACAGTCTTAT CATTTGAATA CTGCTGCTAA 8400 

TTTATCCGAA TGGATGTCCG ACCAAGGTTA 846 0 

CATTCCAATA CCAAGAGGGT TTGTAAAACA 852 0 

GGAAGGCATC AGTTTCGATA CAAACCTAAA 858 0 

TCAAGAACAG GGCATTAAAG GAGC CCATAA 864 0 

ACGAGGAGGA CGCGTAAAAT CTGAAAC CCA 8 70 0 

ATATGAGATT CCTACACTAG ACAGGACAGG 876 0 

AAGTATAAAA ACTGTTTATA ATCCTAAAAA 882 0 

TCAAAATGCT GCTTCACAAG GATATTCAAA 88 8 0 

TAAATCAATA TCGGAAAGAA AAAATGTCAT 8 94 0 

ATTTAGATCA TATTTTGATG TAAATACAGG 9000 

TAAAGGAAAA ATTATGAAAA ATAATATTTT 906 0 

CAACCATTTT GTTATTTCGA TTTTTTTTGA 912 0 

GCTTTTAGAG TGTTTTAAAA ATATTACAAC 918 0 

ATATGAAAAA ATAGATGCTA CCAGATGGAT 924 0 
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TGGAGATTAT GAAGAGGTAA ATGGATTTGA GTATATTGAT AAAGCTCCTT CTATTTATTT 93 00 

TTCAGTTGGA GATGATTTCA ATC CTGAAGA ATTAATTATA CCTATTAATT TAGCATATCA 9360 

TTACTTTAAT ATTGCAATAT CTGATTTCTT AATAGCTCAC CCTGAATATC AAAAAAAGTG 9420 

TAAAGAAATA CAAAAAACAT ATTCTCAAAC AAACTGTAGC CTGCATGAAA CCTAAAATCC 94 80 

ATGCGTAAGG TGTGTGCTTC AGCACGCACG CGTTCCATGA TTTACGGCTC AATGCCGTCT 954 0 

GAAAAGCTCA CAATTTTTCA GACGGCATTT GTTATGCAAG TAAATATTCA GATTCCCTAT 9600 

ATACTGCCCA GACGCGTGCG^T^CTGAAGAC ACCCCCTACG CTTGCTGCAG AACTTTCGGG 9660 

TAAAACCGGT GTGAGCATTA GCGCACCGTA TGC CAATGAG AACAGTCGCA TCCTGCTCAG 972 0 

CACCACGGAT ATCAGTTCGG AAAACGGCAA AATCAAAATT CAATCTTACG GTGACCAATA 9 780 

TTACTATGCG AGACAGAGCG AACTCTATAC CTTTGAACGC CGCAGCTACA AAACTGGCAA 9840 

ATGGTACAAC CGCAAACACA TTACCGAAGT CAAAGAACAC AAAAACGCCA AGCCCGACGC 9900 

AGTAAAC CTC AGCGCATCCC AAGGCATCGA CATCAAATCT GGTGGCAGCA TCGACGCCTA 9960 

CGCCACCGCA TTCGATGCCC CCAAAGGCAG CATTAACATC GAAGCCGGGC GGAAATTGAC 10 02 0 

ACTCTATGCC GTAGAAGAGC TCAACTACGA CAAACTAGAC AGCCAAAAAA GGCGCAGATT 100 80 

TCTCGGCATC AGCTACAGCA AAGCACACGA CACCACCACC CAAGTC^ATGA AAACCGCGCT 10140 

GCCCTCAAGG GTAGTTGCAG AATCAGCCAA CCTCCAATCG GGCTGGGATA CCAAACTGCA 10200 

AGGCACACAG TTTGAAAC C A CACTGGGTGG CGCAACCATA CGCGCAGGCG TAGGTGAGCA 10260 

GGCACGGGCA GATGCCAAGA TTATCCTCGA AGGGATCAAA AGCAGCATCC ACACAGAAAC 10320 

CGTGAGCAGC AGCAAATCTA CTCTATGGCA AAAACAGGCA GGACGGGGCA GTAACATCGA 10380 
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AACCTTGCAA TTGCCGAGTT TCACCGGTCC CGTTGCGCCC GTACTGTCCG CACCCGGCGG 10440 

TTACATTGTC GACATTCCGA AAGGCAATCT GAAAACC CAA ATCGAAACCC TCACCAAGCA 10500 

GCC CGAGTAT GCTTATTTGA AACAACTTCA AGTTGCGAAA AACATCAACT GGAATCAGGT 10560 

GCAGCTTGCT TACGATAAAT GGGACTACAA ACAGGAGGGC ATGACACCCG CAGCAGCAGC 10620 

TGTCGTCGTT ATCGTCGTAA CCGTATTGAC CTACGGTGCA CTGTCCGCCC CGGCAGCCGC 1068 0 

CGGAACGGCG GGCGCGGCAG GCGCAGGAGC GGGAGGAGCC GCAGCAGGAA CGGCAGCCGG 10 740 

AAGTGGAGTA GCAGCAGGAA CGGCAGCCAC AACCGGAGTA GCAGCAGGCA CATCAGCTGC 108 00 

AGCTATCACC ACAGCCGCAG GCAAAGCCGC ACTGGCCAGT CTCGCCAGCC AAGCCGCAGT 10860 

TTCCCTCATC AACAACAAAG GAGACATAAA CCATACCCTG AAAGAACTGG GCAAAAGCAG 10920 

CACCGTCAGA CAGGCCGCCA CCGCCGCCGT AACCGCAGGC GTACTGCAGG GCATAAGCGG 10980 

GCTGAACACC CAAGCAGCCG AAGC CGTC AG CAAACATTTT CACAGTCCCG CAGCAGGCAA 11040 

ACTGACCGCT AACCTGATCA ACAGCACCGC TGCCGCAAGT GTCCATACCG CCATCAACGG 1110 0 

CGGCAGCCTG AAAGACAACT TGGGCGATGC CGCACTGGGT GCGATAGTCA GTACCGTACA 11160 

CGGAGAAGTA GCGAGCAAAA TCAAATTTAA TCTCAGCGAA GACTACATTG CCCACAAGAT 1122 0 

AGCCCATGCC GTAGCAGGCT GTGCATCGGC GGTAGCAAAT AAAGGCAAAT GTCGGGACGG 1128 0 

CGCAATCGGC GCGGCAGTCG GCGAGATGGT GGGAGAAACC CTGTTGGACG GACGCGATGT 1134 0 

AGG CAAACTG TCACCCCAAG AACGCCAAAA AGTCATAGCC TACTCGCAGA TTATCGCAGG 114 0 0 

CAGCGCAGTG GCATTGGTTA AAGGGGATGT GAATACGGCG GTGAATGCGG CTACTGTGGC 1146 0 

AGTGGAGAAT AATAGTCTTT TAGCTCGCAG GAGGGTAAAT ATACGTTGGA CTCCGCGACA 1152 0 
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AGAATTGGAA CATGAATATG CCATTCTTGA AATCCAGGCC ATTACCAATC AAATCCGAAG 11580 

GCTGGATCCG AAATTTAACG GGATTGCTAT TCTGAGGACT CCTGGAGAGC CGTGGACAAG 11640 

ACATGATGTA CAAACATACA GGCAATATTA TAATCAATTA AGGGAATCCA GAGGCTTTGC 11700 

TGTTGAACCA ATTTATAGAA TCAGGATAAA CAACGGCAAT GAATTTAACC GTATCATGTC 11760 

ATCAAAATAC CCTTATAATG AGCTTTATGT AGCCAATCCT AAATCGGCGA CGGGGTATTT 11820 

TAGGGTAGAT TCGTATGATC CTGCGACAAG GGAAATTATT TCAAGAAAAT TTACCCAATT 118 8 0 

TTCTCAAATC CAAGAAAGTA CGGGGATTGG TTATATCAAG GAGGCTGTTA GAAAATATAG 11940 

CCCTGGTACT GTCATTTCCA ATGTTCCAAG TACACCTACT ACGATAAGAG GAAGAAAGCT 120 00 

TGAAGGAAAA CTTATTTTAG AAGTTCCTGC TCAGGTCAAT CCAATTCCAC AATCTGTATT 12 06 0 

AAGGGCGGCA CAAGAAGAAA ATGTTATCAT TAGAGATACA ACAGGAAGGA TTTACAAATG 1212 0 

AAGAAAGATA TTTTTTATTG TGAGCAGTGG TCTTATGGTT ATAAGAGACT TCATAAGCCT 1218 0 

TTTTCTGAGA AACAAGCTGA GGAAAAACAT CTTAAAGGGG AGTTATATAC TGCCGTAATA 12240 

GGTTCGGCGA CACAACCTGA ATATGTAATT ACCTTGCGAG AGGAAGTAGG TTTTTTTTCG 12 3 00 

GTAAATTTTT TCGATAAATT TGGAAGGGAT TATTTAACCC ATCAATTTCA AAAATATTCC 12360 

AATTCGAATT ATTATTTTCT TTCTATGGCT GTATGGAGAG ATTATATAAC TTTGGAATCT 12420 

CATGACTTAG CAGAAGGATA TACTTATTTC TTCAATGAAA ATACGGATGA TTGCTATGTT 12480 

TTGAAACAAG ATTTTATTAA TAATGAGCGA TATGAAAAAA CAGAATTATA TTCCCAAAAA 12 540 

GATAAGGTAA TTCTATTTCC AAAGTTTGGT GAATATGATT TGGTGTTAAA TCCGGACATT 12 600 

ATTTAATTAA GTTTTAAGGC CGTCTGAAAA AAATTTCAAA CGGCTTTTAT TATTGGGTTT 12 66 0 
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GGAATCTGAG GATAAAGCTG ATAAAAACCA GGAAATTATC AGATTGCTAT ATACGTATTG 12720 

TTGTACAGAC TAAAGGCAGC AATCAAATCA CTATTGCTTA CCCACAAAAA TAAATTGATT 12 780 

ATATGGAATA ATCATGAATA AGAGAATGAA AATGTGTCCT GCTTGTCAAC AAGGCTATCT 1284 0 

CTACCATTCG AAACCTAAAT ATCTTCATGA TGAAATTATT CTGTGTGATG AATGCGATGC 12 900 

AGTATGGCTC AAAGGTATGA ATATATTTTA TGGAGAATAT GAAAAAGATT TTTATTCTTA 12960 

TGTTCCTTTC ATGGAATCCC AAGGTATAAC GAGTGAATGT ATTTGGGAAG GAGATTTGTT 13 02 0 

TGATCATCCA TATTATGAAG ATGAAAACTC AAATGATATG GATTGATGGA AATTTTAAGC 13 080 

CTGCGTAGGT ACGATTAGCC ATCAAACGGC GTAATCATAC GCAAGATTAT CAACAGAGAG 1314 0 

GGCTGGCAGC GATATACCAC CCACAAGATT GCCCATGCCA TAGCGGGCTG TGCGGCAGCG 132 00 

GCGGCGAATA AGGGCAAGTG TCAGGATGGT GCGATAGGCG CTGCAGTCGG TGAGATTGTT 13260 

GGTGAGGCTT TGGTTAAGAA TACTGATTTC AGTCGTATGA GTGCGACCGA AATCGAAAAA 133 20 

GCTAAAGCGA AGATTACTGC CTATTCAAAA CTGGTTGCCG GCACTGCGTC TGCCGTTGTA 1338 0 

GGCGGGGATG TGAATACAGC GGCGAATGCG GCACAGATAG CGGTGGAGAA TAATACTTTG 13440 

TATCCTAGAT GCGTTGGTGC AAAGTGTGAT GAATTTCAAA AGGAACAACA AAAATGGATA 13500 

CGTGAAAATC CTGAAGAATA TCGAGAAGTT TTGCTTTTTC AGACAGGATT TATTCCAATT 13 560 

ATCGGTGATA TACAGAGTTT TGTACAAGCA CAGACCGCTG CCGATCACCT GTTTGCTTTG 13620 

CTGGGTGTGG TTCCGGGTAT CGGTGAATCG ATACAGGCCT ATAAAGTAGC GAAAGCGGCA 1368 0 

AAAAATTTAC AAGGCATGAA AAAAGCCTTG GACAAGGCAG CAACCGTTGC CACTGCACAG 13740 

GGCTATGTCA GCAAAAC CAA AATCAAAATC GGTCAAACTG AATTAAGGGT TACTGCAGCA 13 800 
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ACTGACAAAC AATTGCTGAA AGCTATTGGC GAAGGAAGGG ACACGACAGG TAAAATGACC 1386 0 

GAGCAGTTAT TTGACTCTTT AGCTAAACAA AATGGCTTCA GAGTGCTTTC GGGCGGCAAA 13 92 0 

TACGGCGGAA ATAACGGTTT TGATCATGTA TGGCAGGCTG CCGATGGTAG TGTCGTTTTG 13 98 0 

ATTGTAGAAA GTAAGCAGAT TAGGAACGGT ACGGTACAGC TGAATCCGAA TGGTGCGGGT 14 04 0 

GGATATACGC AAATGAGTGA GGATTGGATT AGACAAGTTT TAGATCAATT ACCCGATGGT 14100 

AGTCCCGCTA AAGCTGCTGT CTTCAAAGCA AATAAGAACG GCACATTAAA AACAGCAATA 14160 

GCAGGCGTTG ATCGTCAAAC AGGTAAGGCC GTTATTCTTC CTGTCAAAGT TCCTTCTAAA 1422 0 

ACCAATATAA GGAGATAACA ATGGGGCACA ATATGATGAC CACCCAAAAA TGGTATGAGC 142 8 0 

ATATTACTAA TGTAAT CATA GGCAATACTG CTAATTTCAA TAGCGGTTGC CTTGACTCTA 14340 

TAGATTATGT AGATGAAAGA AAAGGCGTTC CGCTTGCAGC TATGCAACAT ATTTTCATGG 1440 0 

ACGTTAGAGC TGCAGCTTCC CATGCCTATC TATTTGAACA TGATCTTAAG AAATTCAAGC 14460 

AATATGCTTA TGTTGCAGGA AAGCTGGGGG TTTTGCTGAG TGTAAATTCT ACAGACCGTG 14 520 

AACCCTTCTT CTTTCCCTGT GACATGCTCA ACATTCAAAA TCCGATGTTT CTGATGCTGA 14580 

TGAGCGACAG CCCACAGCTG CGTGAGTTTC TGGTGCGCAA TATCGACAAC ATCGCCAACG 14640 

ATACAGAAGC CTTTATAAAC CGCTACGACC TCAACCGGCA TATGATTTAC AATACTCTGC 14700 

TGATGGTGGA GGGTAAGCAG CTTGATCGGT TGAAACAACG TAGCGAGAAA GTCTTGGCGC 14760 

ATCCCACCCC TAGCAAATGG CTGCAAAAGC GGTTGTACGA TTACCGCTTC TTCCTCGCTT 1482 0 

TCG CCGAACA GGATGC CGAG GCAATGAAAG CCGCCTTAGA GCCGCTTTTC GATAAAAAAA 14880 

CCGCGCGTAT GGCTGCCAAA GAAACATTGT CCTATTTCGA TTTCTACCTG CAGCCGCAAA 14940 
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TCGTTACCTA CGCCAAAATC GCATCCATGC ACGGTTTCGA TTTGGGCATA GATCAAGAAA 1500 0 

TCTCACCGAG GGATTTGATT GTTTACGATC CGCTGCCGGC AGACGAATAT CAAGACATCT 1506 0 

TCGATTTTAT GAAACAGTAT GACTTGTCTT ACCCGTATGA ATATCTGCAG GATTGGATAG 15120 

ATTACTATAC GTTCAAAACC GATAAGCTGG TATTTGGTAA CGCGAAGCGA GAGTGAGCCG 15180 

TAAAACTCTG AGCTCCTGTT TTATAGATTA CAACTTTAGG CCGTCTTAAA GCTGAAAGAT 152 40 

TTTCGAAAGC TATAAATTGA AGCCCTTCCA CAGTACATAG ATCTGTGTTG TGGCGGGGCT 153 0 0 

TTAC CACGCT GATTGCCGGA GAAGAACTCA ACCTGCTGGC AAAACAAGGC ATGAGATCTT 15360 

TGCAATAACA TGAGTTGAGA CCTTTGCAAA AAAGCCCTTC CCCGACATCC GAAACC CAAA 1542 0 

CACAGGATTT CGGCTGTTTT CGTAC CAAAT ACCTCCTAAT TTTACCCAAA TACCCCCTTA 15480 

ATCCTCCTCG GACACCCGAT AATCAGGCAT CCGGGCTGCC TTTTAGGCGG CAGCGGGCGC 15540 

ATTTAGCCTG TTGGCCGCTT TCAACAGGTT CAAACACATC GCCTTCAGGT GGCTTTGCGC 1560 0 

ACTCACTTTG TCATTTCCAA 15620 
(2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME / KEY ; Protein 

(B) LOCATION: 1..580 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37: 

Met Lys Phe Phe Pro Ala Pro Cys Leu Leu Val He Leu Ala Val He 
1 5 10 is 

Pro Leu Lys Thr Leu Ala Ala Asp Glu Asn Asp Ala Glu Leu He Arg 
20 25 30 

Ser Met Gin Arg Gin Gin His He Asp Ala Glu Leu Leu Thr Asp Ala 
35 40 45 

Asn Val Arg Phe Glu Gin Pro Leu Glu Lys Asn Asn Tyr Val Leu Ser 
50 55 60 

Glu Asp Glu Thr Pro Cys Thr Arg Val Asn Tyr He Ser Leu Asp Asp 
65 70 75 80 

Lys Thr Ala Arg Lys Phe Ser Phe Leu Pro Ser Val Leu Met Lys Glu 
85 90 95 

Thr Ala Phe Lys Thr Gly Met Cys Leu Gly Ser Asn Asn Leu Ser Arg 
100 105 110 

Leu Gin Lys Ala Ala Gin Gin He Leu He Val Arg Gly Tyr Leu Thr 
115 120 125 

Ser Gin Ala He He Gin Pro Gin Asn Met Asp Ser Gly He Leu Lys 
130 135 140 

Leu Arg Val Ser Ala Gly Glu He Gly Asp He Arg Tyr Glu Glu Lys 
145 150 155 160 

Arg Asp Gly Lys Ser Ala Glu Gly Ser He Ser Ala Phe Asn Asn Lys 
165 170 175 

Phe Pro Leu Tyr Arg Asn Lys He Leu Asn Leu Arg Asp Val Glu Gin 
180 185 190 
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Gly Leu Glu Asn Leu Arg Arg Leu Pro Ser Val Lys Thr Asp lie Gin 
195 200 205 

He He Pro Ser Glu Glu Glu Gly Lys Ser Asp Leu Gin He Lys Trp 
210 215 220 

Gin Gin Asn Lys Pro He Arg Phe Ser He Gly He Asp Asp Ala Gly 
225 230 235 240 

Gly Lys Thr Thr Gly Lys Tyr Gin Gly Asn Val Ala Leu Ser Phe Asp 
245 250 255 

Asn Pro Leu Gly Leu Ser Asp Leu Phe Tyr Val Ser Tyr Gly Arg Gly 
260 265 270 

Leu Val His Lys Thr Asp Leu Thr Asp Ala Thr Gly Thr Glu Thr Glu 
275 280 285 

Ser Gly Ser Arg Ser Tyr Ser Val His Tyr Ser Val Pro Val Lys Lys 
290 295 300 

Trp Leu Phe Ser Phe Asn His Asn Gly His Arg Tyr His Glu Ala Thr 
305 310 315 320 

Glu Gly Tyr Ser Val Asn Tyr Asp Tyr Asn Gly Lys Gin Tyr Gin Ser 
325 330 335 

Ser Leu Ala Ala Glu Arg Met Leu Trp Arg Asn Arg Phe His Lys Thr 
340 345 350 

Ser Val Gly Met Lys Leu Trp Thr Arg Gin Thr Tyr Lys Tyr He Asp 
355 360 365 

Asp Ala Glu He Glu Val Gin Arg Arg Arg Ser Ala Gly Trp Glu Ala 
370 375 380 

Glu Leu Arg His Arg Ala Tyr Leu Asn Arg Trp Gin Leu Asp Gly Lys 
385 390 395 400 
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Leu Ser Tyr Lys Arg Gly Thr Gly Met Arg Gin Ser Met Pro Ala Pro 
405 410 415 

Glu Glu Asn Gly Gly Gly Thr lie Pro Gly Thr Ser Arg Met Lys lie 
420 425 430 

He Thr Ala Gly Leu Asp Ala Ala Ala Pro Phe Met Leu Gly Lys Gin 
435 440 445 

Gin Phe Phe Tyr Ala Thr Ala He Gin Ala Gin Trp Asn Lys Thr Pro 
450 455 460 

Leu Val Ala Gin Asp Lys Leu Ser He Gly Ser Arg Tyr Thr Val Arg 
465 470 475 480 

Gly Phe Asp Gly Glu Gin Ser Leu Phe Gly Glu Arg Gly Phe Tyr Trp 
485 490 495 

Gin Asn Thr Leu Thr Trp Tyr Phe His Pro Asn His Gin Phe Tyr Leu 
500 505 510 

Gly Ala Asp Tyr Gly Arg Val Ser Gly Glu Ser Ala Gin Tyr Val Ser 
515 520 525 

Gly Lys Gin Leu Met Gly Ala Val Val Gly Phe Arg Gly Gly His Lys 
530 535 540 

Val Gly Gly Met Phe Ala Tyr Asp Leu Phe Ala Gly Lys Pro Leu His 
545 550 555 560 

Lys Pro Lys Gly Phe Gin Thr Thr Asn Thr Val Tyr Gly Phe Asn Leu 
565 570 575 

Asn Tyr Ser Phe 
580 

INFORMATION FOR SEQ ID NO : 38: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..1981 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



Met Asn Lys Gly 
1 

Met Val Ala Val 
20 

Ala Gly Ser Ser 
35 

Gly Lys Leu Lys 
50 

Leu Ser Met Val 
65 

Ala Pro Lys Asn 

Pro Leu Val Asn 
100 



Leu His Arg lie 
5 

Ala Glu Thr Ala 

Val Ser Val Ser 
40 

Thr Thr Leu Lys 
55 

Leu Pro Ala His 
70 

Gin Gin Val Val 
85 

lie Gin Thr Pro 



lie Phe Ser Lys 
10 

Asn Ser Gin Gly 
25 

Leu Lys Thr Ser 

Thr Leu Val Cys 
60 

Ala Gin lie Thr 
75 

lie Leu Lys Thr 
90 

Asn Gly Arg Gly 
105 



Lys His Ser Thr 
15 

Lys Gly Lys Gin 
30 

Gly Asp Leu Cys 
45 

Ser Leu Val Ser 

Thr Asp Lys Ser 
80 

Asn Thr Gly Ala 
95 

Leu Ser His Asn 
110 



REPLACEMENT SHEET (RULE 26) 



89 



Arg Tyr Thr Gin Phe Asp Val Asp Asn Lys Gly Ala Val Leu Asn Asn 
115 120 125 

Asp Arg Asn Asn Asn Pro Phe Leu Val Lys Gly Ser Ala Gin Leu He 
130 135 140 

Leu Asn Glu Val Arg Gly Thr Ala Ser Lys Leu Asn Gly He Val Thr 
145 150 155 iso 

Val Gly Gly Gin Lys Ala Asp Val He He Ala Asn Pro Asn Gly He 
1^5 170 175 

Thr Val Asn Gly Gly Gly Phe Lys Asn Val Gly Arg Gly He Leu Thr 
180 185 190 

He Gly Ala Pro Gin He Gly Lys Asp Gly Ala Leu Thr Gly Phe Asp 
195 200 205 

Val Arg Gin Gly Thr Leu Thr Val Gly Ala Ala Gly Trp Asn Asp Lys 
210 215 220 

Gly Gly Ala Asp Tyr Thr Gly Val Leu Ala Arg Ala Val Ala Leu Gin 
225 230 235 240 

Gly Lys Leu Gin Gly Lys Asn Leu Ala Val Ser Thr Gly Pro Gin Lys 
245 250 255 

Val Asp Tyr Ala Ser Gly Glu He Ser Ala Gly Thr Ala Ala Gly Thr 
260 265 270 

Lys Pro Thr He Ala Leu Asp Thr Ala Ala Leu Gly Gly Met Tyr Ala 
275 280 285 

Asp Ser He Thr Leu He Ala Asn Glu Lys Gly Val Gly Val Lys Asn 
290 295 300 

Ala Gly Thr Leu Glu Ala Ala Lys Gin Leu He Val Thr Ser Ser Gly 
305 310 315 320 
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Arg He Glu Asn Ser Gly Arg He Ala Thr Thr Ala Asp Gly Thr Glu 
325 330 335 

Ala Ser Pro Thr Tyr Leu Ser He Glu Thr Thr Glu Lys Gly Ala Ala 
340 345 350 

Gly Thr Phe He Ser Asn Gly Gly Arg He Glu Ser Lys Gly Leu Leu 
355 360 365 

Val He Glu Thr Gly Glu Asp He Ser Leu Arg Asn Gly Ala Val Val 
370 375 380 

Gin Asn Asn Gly Ser Arg Pro Ala Thr Thr Val Leu Asn Ala Gly His 
385 390 395 400 

Asn Leu Val He Glu Ser Lys Thr Asn Val Asn Asn Ala Lys Gly Ser 
405 410 415 

Ala Asn Leu Ser Ala Gly Gly Arg Thr Thr He Asn Asp Ala Thr He 
420 425 430 

Gin Ala Gly Ser Ser Val Tyr Ser Ser Thr Lys Gly Asp Thr Glu Leu 
435 440 445 

Gly Glu Asn Thr Arg He He Ala Glu Asn Val Thr Val Leu Ser Asn 
450 455 460 

Gly Ser He Gly Ser Ala Ala Val He Glu Ala Lys Asp Thr Ala His 
465 470 475 480 

lie Glu Ser Gly Lys Pro Leu Ser Leu Glu Thr Ser Thr Val Ala Ser 
485 490 495 

Asn He Arg Leu Asn Asn Gly Asn He Lys Gly Gly Lys Gin Leu Ala 
500 505 510 
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Leu Leu Ala Asp Asp Asn He Thr Ala Lys Thr Thr Asn Leu Asn Thr 
515 520 525 

Pro Gly Asn Leu Tyr Val His Thr Gly Lys Asp Leu Asn Leu Asn Val 
530 535 540 

Asp Lys Asp Leu Ser Ala Ala Ser He His Leu Lys Ser Asp Asn Ala 
545 550 555 560 

Ala His He Thr Gly Thr Ser Lys Thr Leu Thr Ala Ser Lys Asp Met 
565 570 575 

Gly Val Glu Ala Gly Leu Leu Asn Val Thr Asn Thr Asn Leu Arg Thr 
580 585 590 

Asn Ser Gly Asn Leu His He Gin Ala Ala Lys Gly Asn He Gin Leu 
595 600 605 

Arg Asn Thr Lys Leu Asn Ala Ala Lys Ala Leu Glu Thr Thr Ala Leu 
510 615 620 

Gin Gly Asn He Val Ser Asp Gly Leu His Ala Val Ser Ala Asp Gly 
625 630 635 640 

His Val Ser Leu Leu Ala Asn Gly Asn Ala Asp Phe Thr Gly His Asn 
645 650 655 

Thr Leu Thr Ala Lys Ala Asp Val Asn Ala Gly Ser Val Gly Lys Gly 
660 665 670 

Arg Leu Lys Ala Asp Asn Thr Asn He Thr Ser Ser Ser Gly Asp He 
675 680 685 

Thr Leu Val Ala Gly Asn Gly He Gin Leu Gly Asp Gly Lys Gin Arg 
690 695 700 

Asn Ser He Asn Gly Lys His He Ser He Lys Asn Asn Gly Gly Asn 
705 710 715 720 
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Ala Asp Leu Lys Asn Leu Asn Val His Ala Lys Ser Gly Ala Leu Asn 
725 730 735 

lie His Ser Asp Arg Ala Leu Ser lie Glu Asn Thr Lys Leu Glu Ser 
740 745 750 

Thr His Asn Thr His Leu Asn Ala Gin His Glu Arg Val Thr Leu Asn 
755 760 765 

Gin Val Asp Ala Tyr Ala His Arg His Leu Ser lie Thr Gly Ser Gin 
770 775 780 

lie Trp Gin Asn Asp Lys Leu Pro Ser Ala Asn Lys Leu Val Ala Asn 
785 790 795 800 

Gly Val Leu Ala Leu Asn Ala Arg Tyr Ser Gin He Ala Asp Asn Thr 
805 810 815 

Thr Leu Arg Ala Gly Ala He Asn Leu Thr Ala Gly Thr Ala Leu Val 
820 825 830 

Lys Arg Gly Asn He Asn Trp Ser Thr Val Ser Thr Lys Thr Leu Glu 
835 840 845 

Asp Asn Ala Glu Leu Lys Pro Leu Ala Gly Arg Leu Asn He Glu Ala 
850 855 860 

Gly Ser Gly Thr Leu Thr He Glu Pro Ala Asn Arg He Ser Ala His 
865 870 875 880 

Thr Asp Leu Ser He Lys Thr Gly Gly Lys Leu Leu Leu Ser Ala Lys 
885 890 895 

Gly Gly Asn Ala Gly Ala Pro Ser Ala Gin Val Ser Ser Leu Glu Ala 
900 905 910 
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Lys Gly Asn He Arg Leu Val Thr Gly Glu Thr Asp Leu Arg Gly Ser 
915 920 925 

Lys He Thr Ala Gly Lys Asn Leu Val Val Ala Thr Thr Lys Gly Lys 
930 935 940 

Leu Asn He Glu Ala Val Asn Asn Ser Phe Ser Asn Tyr Phe Pro Thr 
945 950 955 960 

Gin Lys Ala Ala Glu Leu Asn Gin Lys Ser Lys Glu Leu Glu Gin Gin 
965 970 975 

He Ala Gin Leu Lys Lys Ser Ser Pro Lys Ser Lys Leu He Pro Thr 
980 985 990 

Leu Gin Glu Glu Arg Asp Arg Leu Ala Phe Tyr He Gin Ala He Asn 
995 1000 1005 

Lys Glu Val Lys Gly Lys Lys Pro Lys Gly Lys Glu Tyr Leu Gin Ala 
1010 1015 1020 

Lys Leu Ser Ala Gin Asn He Asp Leu He Ser Ala Gin Gly He Glu 
1025 1030 1035 1040 

He Ser Gly Ser Asp He Thr Ala Ser Lys Lys Leu Asn Leu His Ala 
1045 1050 1055 

Ala Gly Val Leu Pro Lys Ala Ala Asp Ser Glu Ala Ala Ala He Leu 
1060 1065 1070 

He Asp Gly He Thr Asp Gin Tyr Glu He Gly Lys Pro Thr Tyr Lys 
1075 1080 1085 

Ser His Tyr Asp Lys Ala Ala Leu Asn Lys Pro Ser Arg Leu Thr Gly 
1090 1095 1100 

Arg Thr Gly Val Ser He His Ala Ala Ala Ala Leu Asp Asp Ala Arg 
H05 mo 1U5 1120 
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lie lie He Gly Ala Ser Glu He Lys Ala Pro Ser Gly Ser lie Asp 
1125 H30 H35 

lie Lys Ala His Ser Asp He Val Leu Glu Ala Gly Gin Asn Asp Ala 
1140 H45 use 

Tyr Thr Phe Leu Lys Thr Lys Gly Lys Ser Gly Lys He He Arg Lys 
1155 1160 H65 

Thr Lys Phe Thr Ser Thr Arg Asp His Leu He Met Pro Ala Pro Val 
H70 H75 iiso 

Glu Leu Thr Ala Asn Gly He Thr Leu Gin Ala Gly Gly Asn He Glu 
1185 1190 H95 1200 

Ala Asn Thr Thr Arg Phe Asn Ala Pro Ala Gly Lys Val Thr Leu Val 
1205 1210 1215 

Ala Gly Glu Glu Leu Gin Leu Leu Ala Glu Glu Gly He His Lys His 
1220 1225 1230 

Glu Leu Asp Val Gin Lys Ser Arg Arg Phe He Gly He Lys Val Gly 
1235 1240 1245 

Lys Ser Asn Tyr Ser Lys Asn Glu Leu Asn Glu Thr Lys Leu Pro Val 
1250 1255 1260 

Arg Val Val Ala Gin Thr Ala Ala Thr Arg Ser Gly Trp Asp Thr Val 
1265 1270 1275 1280 

Leu Glu Gly Thr Glu Phe Lys Thr Thr Leu Ala Gly Ala Asp He Gin 
1285 1290 1295 

Ala Gly Val Gly Glu Lys Ala Arg Val Asp Ala Lys He He Leu Lys 
1300 1305 1310 
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Gly He Val Asn Arg He Gin Ser Glu Glu Lys Leu Glu Thr Asn Ser 
1315 1320 1325 

Thr Val Trp Gin Lys Gin Ala Gly Arg Gly Ser Thr He Glu Thr Leu 
1330 1335 1340 

Lys Leu Pro Ser Phe Glu Ser Pro Thr Pro Pro Lys Leu Ser Ala Pro 
134 $ 1350 1355 1360 

Gly Gly Tyr He Val Asp He Pro Lys Gly Asn Leu Lys Thr Glu He 
1365 1370 1375 

Glu Lys Leu Ser Lys Gin Pro Glu Tyr Ala Tyr Leu Lys Gin Leu Gin 
1380 1385 1390 

Val Ala Lys Asn He Asn Trp Asn Gin Val Gin Leu Ala Tyr Asp Arg 
1395 1400 1405 

Trp Asp Tyr Lys Gin Glu Gly Leu Thr Glu Ala Gly Ala Ala He He 
1410 1415 1420 

Ala Leu Ala Val Thr Val Val Thr Ser Gly Ala Gly Thr Gly Ala Val 
1425 1430 1435 1440 

Leu Gly Leu Asn Gly Ala Ala Ala Ala Ala Thr Asp Ala Ala Phe Ala 
1445 1450 1455 

Ser Leu Ala Ser Gin Ala Ser Val Ser Phe He Asn Asn Lys Gly Asp 
1460 1465 1470 

Val Gly Lys Thr Leu Lys Glu Leu Gly Arg Ser Ser Thr Val Lys Asn 
1475 1480 1485 

Leu Val Val Ala Ala Ala Thr Ala Gly Val Ala Asp Lys He Gly Ala 
1490 1495 1500 

Ser Ala Leu Asn Asn Val Ser Asp Lys Gin Trp He Asn Asn Leu Thr 
1505 1510 1515 1520 
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Val Asn Leu Ala Asn Ala Gly Ser Ala Ala Leu lie Asn Thr Ala Val 
1525 1530 1535 

Asn Gly Gly Ser Leu Lys Asp Asn Leu Glu Ala Asn lie Leu Ala Ala 
1540 1545 1550 

Leu Val Asn Thr Ala His Gly Glu Ala Ala Ser Lys lie Lys Gin Leu 
1555 1560 1565 

Asp Gin His Tyr lie Val His Lys lie Ala His Ala lie Ala Gly Cys 
1570 1575 1580 

Ala Ala Ala Ala Ala Asn Lys Gly Lys Cys Gin Asp Gly Ala lie Gly 
1585 1590 1595 1600 

Ala Ala Val Gly Glu lie Val Gly Glu Ala Leu Thr Asn Gly Lys Asn 
1605 1610 1615 

Pro Asp Thr Leu Thr Ala Lys Glu Arg Glu Gin lie Leu Ala Tyr Ser 
1620 1625 1630 

Lys Leu Val Ala Gly Thr Val Ser Gly Val Val Gly Gly Asp Val Asn 
1635 1640 1645 

Ala Ala Ala' Asn Ala Ala Glu Val Ala Val Lys Asn Asn Gin Leu Ser 
1650 1655 1660 

Asp Lys Glu Gly Arg Glu Phe Asp Asn Glu Met Thr Ala Cys Ala Lys 
1665 1670 1675 1680 

Gin Asn Asn Pro Gin Leu Cys Arg Lys Asn Thr Val Lys Lys Tyr Gin 
1685 1690 1695 

Asn Val Ala Asp Lys Arg Leu Ala Ala Ser He Ala He Cys Thr Asp 
1700 1705 1710 
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lie Ser Arg Ser Thr Glu Cys Arg Thr lie Arg Lys Gin His Leu lie 
1715 1720 1725 

Asp Ser Arg Ser Leu His Ser Ser Trp Glu Ala Gly Leu lie Gly Lys 
1730 1735 1740 

Asp Asp Glu Trp Tyr Lys Leu Phe Ser Lys Ser Tyr Thr Gin Ala Asp 
1745 1750 1755 1760 

Leu Ala Leu Gin Ser Tyr His Leu Asn Thr Ala Ala Lys Ser Trp Leu 
1765 1770 1775 

Gin Ser Gly Asn Thr Lys Pro Leu Ser Glu Trp Met Ser Asp Gin Gly 
1780 1785 1790 

Tyr Thr Leu lie Ser Gly Val Asn Pro Arg Phe lie Pro lie Pro Arg 
1795 1800 1805 

Gly Phe Val Lys Gin Asn Thr Pro lie Thr Asn Val Lys Tyr Pro Glu 
1810 1815 1820 

Gly lie Ser Phe Asp Thr Asn Leu Lys Arg His Leu Ala Asn Ala Asp 
1825 1830 1835 1840 

Gly Phe Ser Gin Glu Gin Gly He Lys Gly Ala His Asn Arg Thr Asn 
1845 1850 1855 

Phe Met Ala Glu Leu Asn Ser Arg Gly Gly Arg Val Lys Ser Glu Thr 
1860 1865 1870 

Gin Thr Asp He Glu Gly He Thr Arg He Lys Tyr Glu He Pro Thr 
1875 1880 1885 

Leu Asp Arg Thr Gly Lys Pro Asp Gly Gly Phe Lys Glu He Ser Ser 
1890 1895 1900 

He Lys Thr Val Tyr Asn Pro Lys Lys Phe Ser Asp Asp Lys He Leu 
1905 1910 1915 1920 
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Gin Met Ala Gin Asn Ala Ala Ser Gin Gly Tyr Ser Lys Ala Ser Lys 
1925 1930 1935 

lie Ala Gin Asn Glu Arg Thr Lys Ser lie Ser Glu Arg Lys Asn Val 
1940 1945 1950 

He Gin Phe Ser Glu Thr Phe Asp Gly He Lys Phe Arg Ser Tyr Phe 
1955 1960 1965 

Asp Val Asn Thr Gly Arg He Thr Asn He His Pro Glu 
1970 1975 1980 

INFORMATION FOR SEQ ID NO : 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..143 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 39: 

Met Lys Asn Asn He Phe Leu Asn Leu Asn Lys Lys Ser He Asn Asn 
15 10 15 

Asn His Phe Val He Ser He Phe Phe Glu Thr He Tyr Gin Phe Glu 
20 25 30 
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Thr Lys Asp Thr Leu Leu Glu Cys Phe Lys Asn lie Thr Thr Thr Gly 
35 40 45 

His Phe Gly Val lie Gly Ala Gin Tyr Glu Lys He Asp Ala Thr Arg 
50 55 60 

Trp He Gly Asp Tyr Glu Glu Val Asn Gly Phe Glu Tyr He Asp Lys 
65 70 75 80 

Ala Pro Ser He Tyr Phe Ser Val Gly Asp Asp Phe Asn Pro Glu Glu 
85 90 95 

Leu He He Pro He Asn Leu Ala Tyr His Tyr Phe Asn He Ala He 
100 105 no 

Ser Asp Phe Leu He Ala His Pro Glu Tyr Gin Lys Lys Cys Lys Glu 
115 120 125 

He Gin Lys Thr Tyr Ser Gin Thr Asn Cys Ser Leu His Glu Thr 
130 135 140 

INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME / KEY : Peptide 

(B) LOCATION: 1..833 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



REPLACEMENT SHEET (RULE 26) 



100 



Val Leu Lys Thr Pro Pro Thr Leu Ala Ala Glu Leu Ser Gly Lys Thr 
15 10 15 

Gly Val Ser lie Ser Ala Pro Tyr Ala Asn Glu Asn Ser Arg He Leu 
20 25 30 

Leu Ser Thr Thr Asp He Ser Ser Glu Asn Gly Lys He Lys He Gin 
35 40 45 

Ser Tyr Gly Asp Gin Tyr Tyr Tyr Ala Arg Gin Ser Glu Leu Tyr Thr 
50 55 60 

Phe Glu Arg Arg Ser Tyr Lys Thr Gly Lys Trp Tyr Asn Arg Lys His 
65 70 75 80 

He Thr Glu Val Lys Glu His Lys Asn Ala Lys Pro Asp Ala Val Asn 
85 90 95 

Leu Ser Ala Ser Gin Gly lie Asp lie Lys Ser Gly Gly Ser He Asp 
100 105 110 

Ala Tyr Ala Thr Ala Phe Asp Ala Pro Lys Gly Ser He Asn He Glu 
115 120 125 

Ala Gly Arg Lys Leu Thr Leu Tyr Ala Val Glu Glu Leu Asn Tyr Asp 
130 135 140 

Lys Leu Asp Ser Gin Lys Arg Arg Arg Phe Leu Gly He Ser Tyr Ser 
145 150 155 160 

Lys Ala His Asp Thr Thr Thr Gin Val Met Lys Thr Ala Leu Pro Ser 
165 170 175 

Arg Val Val Ala Glu Ser Ala Asn Leu Gin Ser Gly Trp Asp Thr Lys 
180 185 190 
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Leu Gin Gly Thr Gin Phe Glu Thr Thr Leu Gly Gly Ala Thr lie Arg 
195 200 205 

Ala Gly Val Gly Glu Gin Ala Arg Ala Asp Ala Lys lie lie Leu Glu 
210 215 220 

Gly lie Lys Ser Ser Xle His Thr Glu Thr Val Ser Ser Ser Lys Ser 
225 230 235 240 

Thr Leu Trp Gin Lys Gin Ala Gly Arg Gly Ser Asn lie Glu Thr Leu 
245 250 255 

Gin Leu Pro Ser Phe Thr Gly Pro Val Ala Pro Val Leu Ser Ala Pro 
260 265 270 

Gly Gly Tyr lie Val Asp lie Pro Lys Gly Asn Leu Lys Thr Gin lie 
275 280 285 

Glu Thr Leu Thr Lys Gin Pro Glu Tyr Ala Tyr Leu Lys Gin Leu Gin 
290 295 300 

Val Ala Lys Asn lie Asn Trp Asn Gin Val Gin Leu Ala Tyr Asp Lys 
305 310 315 320 

Trp Asp Tyr Lys Gin Glu Gly Met Thr Pro Ala Ala Ala Ala Val Val 
325 330 335 

Val lie Val Val Thr Val Leu Thr Tyr Gly Ala Leu Ser Ala Pro Ala 
340 345 350 

Ala Ala Gly Thr Ala Gly Ala Ala Gly Ala Gly Ala Gly Gly Ala Ala 
355 360 365 

Ala Gly Thr Ala Ala Gly Thr Gly Val Ala Ala Gly Thr Ala Ala Thr 
370 375 380 

Thr Gly Val Ala Ala Gly Thr Ser Ala Ala Ala lie Thr Thr Ala Ala 
385 390 395 400 
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Gly Lys Ala Ala Leu Ala Ser Leu Ala Ser Gin Ala Ala Val Ser Leu 
405 410 415 

lie Asn Asn Lys Gly Asp lie Asn His Thr Leu Lys Glu Leu Gly Lys 
420 425 430 

Ser Ser Thr Val Arg Gin Ala Ala Thr Ala Ala Val Thr Ala Gly Val 
435 440 445 

Leu Gin Gly He Ser Gly Leu Asn Thr Gin Ala Ala Glu Ala Val Ser 
450 455 460 

Lys His Phe His Ser Pro Ala Ala Gly Lys Leu Thr Ala Asn Leu He 
465 470 475 480 

Asn Ser Thr Ala Ala Ala Ser Val His Thr Ala He Asn Gly Gly Ser 
485 490 495 

Leu Lys Asp Asn Leu Gly Asp Ala Ala Leu Gly Ala He Val Ser Thr 
500 505 510 

Val His Gly Glu Val Ala Ser Lys He Lys Phe Asn Leu Ser Glu Asp 
515 520 525 

Tyr He Ala His Lys He Ala His Ala Val Ala Gly Cys Ala Ser Ala 
530 535 540 

Val Ala Asn Lys Gly Lys Cys Arg Asp Gly Ala He Gly Ala Ala Val 
545 550 555 560 

Gly Glu Met Val Gly Glu Thr Leu Leu Asp Gly Arg Asp Val Gly Lys 
565 570 575 

Leu Ser Pro Gin Glu Arg Gin Lys Val He Ala Tyr Ser Gin He He 
580 585 590 
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Ala Gly Ser Ala Val Ala Leu Val Lys Gly Asp Val Asn Thr Ala Val 
595 600 605 

Asn Ala Ala Thr Val Ala Val Glu Asn Asn Ser Leu Leu Ala Arg Arg 
610 615 620 

Arg Val Asn lie Arg Trp Thr Pro Arg Gin Glu Leu Glu His Glu Tyr 
625 630 635 640 

Ala He Leu Glu He Gin Ala He Thr Asn Gin He Arg Arg Leu Asp 
645 650 655 

Pro Lys Phe Asn Gly He Ala He Leu Arg Thr Pro Gly Glu Pro Trp 
660 665 670 

Thr Arg His Asp Val Gin Thr Tyr Arg Gin Tyr Tyr Asn Gin Leu Arg 
675 680 685 

Glu Ser Arg Gly Phe Ala Val Glu Pro He Tyr Arg He Arg He Asn 
690 695 700 

Asn Gly Asn Glu Phe Asn Arg He Met Ser Ser Lys Tyr Pro Tyr Asn 
705 710 715 720 

Glu Leu Tyr Val Ala Asn Pro Lys Ser Ala Thr Gly Tyr Phe Arg Val 
725 730 735 

Asp Ser Tyr Asp Pro Ala Thr Arg Glu He He Ser Arg Lys Phe Thr 
740 745 750 

Gin Phe Ser Gin He Gin Glu Ser Thr Gly He Gly Tyr He Lys Glu 
755 760 765 

Ala Val Arg Lys Tyr Ser Pro Gly Thr Val He Ser Asn Val Pro Ser 
770 775 780 

Thr Pro Thr Thr He Arg Gly Arg Lys Leu Glu Gly Lys Leu He Leu 
785 790 795 800 
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Glu Val Pro Ala 

Ala Gin Glu Glu 
820 



Gin Val Asn Pro 
805 

Asn Val lie lie 



lie Pro Gin Ser 
810 

Arg Asp Thr Thr 
825 



Val Leu Arg Ala 
815 

Gly Arg lie Tyr 
830 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Val Leu Lys Thr Pro Pro Thr Leu Ala Ala Glu Leu Ser Gly Lys Thr 
1 5 10 15 

Gly Val Ser lie Ser Ala Pro Tyr Ala Asn Glu Asn Ser Arg lie Leu 
20 25 30 

Leu Ser Thr Thr Asp lie Ser Ser Glu Asn Gly Lys He Lys He Gin 
35 40 45 

Ser Tyr Gly Asp Gin Tyr Tyr Tyr Ala Arg Gin Ser Glu Leu Tyr Thr 
50 55 60 

Phe Glu Arg Arg Ser Tyr Lys Thr Gly Lys Trp Tyr Asn Arg Lys His 
65 70 75 80 

He Thr Glu Val Lys Glu His Lys Asn Ala Lys Pro Asp Ala Val Asn 
85 90 95 
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Leu Ser Ala Ser Gin Gly He Asp He Lys Ser Gly Gly Ser He Asp 
100 105 110 

Ala Tyr Ala Thr Ala Phe Asp Ala Pro Lys Gly Ser lie Asn He Glu 
115 120 125 

Ala Gly Arg Lys Leu Thr Leu Tyr Ala Val Glu Glu Leu Asn Tyr Asp 
130 135 140 

Lys Leu Asp Ser Gin Lys Arg Arg Arg Phe Leu Gly He Ser Tyr Ser 
145 150 155 160 

Lys Ala His Asp Thr Thr Thr Gin Val Met Lys Thr Ala Leu Pro Ser 
165 170 175 

Arg Val Val Ala Glu Ser Ala Asn Leu Gin Ser Gly Trp Asp Thr Lys 
180 185 190 

Leu Gin Gly Thr Gin Phe Glu Thr Thr Leu Gly Gly Ala Thr He Arg 
195 200 205 

Ala Gly Val Gly Glu Gin Ala Arg Ala Asp Ala Lys He He Leu Glu 
210 215 220 

Gly He Lys Ser Ser He His Thr Glu Thr Val Ser Ser Ser Lys Ser 
225 230 235 240 

Thr Leu Trp Gin Lys Gin Ala Gly Arg Gly Ser Asn He Glu Thr Leu 
245 250 255 

Gin Leu Pro Ser Phe Thr Gly Pro Val Ala Pro Val Leu Ser Ala Pro 
260 265 270 

Gly Gly Tyr He Val Asp He Pro Lys Gly Asn Leu Lys Thr Gin He 
275 280 285 
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Glu Thr Leu Thr Lys Gin Pro Glu Tyr Ala Tyr Leu Lys Gin Leu Gin 
290 295 300 

Val Ala Lys Asn lie Asn Trp Asn Gin Val Gin Leu Ala Tyr Asp Lys 
305 310 315 320 

Trp Asp Tyr Lys Gin Glu Gly Met Thr Pro Ala Ala Ala Ala Val Val 
325 330 335 

Val lie Val Val Thr Val Leu Thr Tyr Gly Ala Leu Ser Ala Pro Ala 
340 345 350 

Ala Ala Gly Thr Ala Gly Ala Ala Gly Ala Gly Ala Gly Gly Ala Ala 
355 360 365 

Ala Gly Thr Ala Ala Gly Thr Gly Val Ala Ala Gly Thr Ala Ala Thr 
370 375 380 

Thr Gly Val Ala Ala Gly Thr Ser Ala Ala Ala He Thr Thr Ala Ala 
385 390 395 400 

Gly Lys Ala Ala Leu Ala Ser Leu Ala Ser Gin Ala Ala Val Ser Leu 
405 410 415 

He Asn Asn Lys Gly Asp He Asn His Thr Leu Lys Glu Leu Gly Lys 
420 - 425 430 

Ser Ser Thr Val Arg Gin Ala Ala Thr Ala Ala Val Thr Ala Gly Val 
435 440 445 

Leu Gin Gly He Ser Gly Leu Asn Thr Gin Ala Ala Glu Ala Val Ser 
450 455 460 

Lys His Phe His Ser Pro Ala Ala Gly Lys Leu Thr Ala Asn Leu He 
465 470 475 480 

Asn Ser Thr Ala Ala Ala Ser Val His Thr Ala He Asn Gly Gly Ser 
485 490 495 
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Leu Lys Asp Asn Leu Gly Asp Ala Ala Leu Gly Ala lie Val Ser Thr 
500 505 510 

Val His Gly Glu Val Ala Ser Lys lie Lys Phe Asn Leu Ser Glu Asp 
515 520 525 

Tyr lie Ala His Lys He Ala His Ala Val Ala Gly Cys Ala Ser Ala 
530 535 540 

Val Ala Asn Lys Gly Lys Cys Arg Asp Gly Ala He Gly Ala Ala Val 
545 550 555 560 

Gly Glu Met Val Gly Glu Thr Leu Leu Asp Gly Arg Asp Val Gly Lys 
565 570 575 

Leu Ser Pro Gin Glu Arg Gin Lys Val He Ala Tyr Ser Gin He He 
580 585 590 

Ala Gly Ser Ala Val Ala Leu Val Lys Gly Asp Val Asn Thr Ala Val 
595 600 605 

Asn Ala Ala Thr Val Ala Val Glu Asn Asn Ser Leu Leu Ala Arg Arg 
610 615 620 

Arg Val Asn He Arg Trp Thr Pro Arg Gin Glu Leu Glu His Glu Tyr 
625 630 635 640 

Ala He Leu Glu He Gin Ala He Thr Asn Gin He Arg Arg Leu Asp 
645 650 655 

Pro Lys Phe Asn Gly He Ala He Leu Arg Thr Pro Gly Glu Pro Trp 
660 665 670 

Thr Arg His Asp Val Gin Thr Tyr Arg Gin Tyr Tyr Asn Gin Leu Arg 
675 680 685 



REPLACEMENT SHEET (RULE 26) 



108 

Glu Ser Arg Gly Phe Ala Val Glu Pro lie Tyr Arg lie Arg lie Asn 
690 695 700 

Asn Gly Asn Glu Phe Asn Arg lie Met Ser Ser Lys Tyr Pro Tyr Asn 
705 710 715 720 

Glu Leu Tyr Val Ala Asn Pro Lys Ser Ala Thr Gly Tyr Phe Arg Val 
725 730 735 

Asp Ser Tyr Asp Pro Ala Thr Arg Glu lie lie Ser Arg Lys Phe Thr 
740 745 750 

Gin Phe Ser Gin lie Gin Glu Ser Thr Gly lie Gly Tyr lie Lys Glu 
755 760 765 

Ala Val Arg Lys Tyr Ser Pro Gly Thr Val lie Ser Asn Val Pro Ser 
770 775 780 

Thr Pro Thr Thr lie Arg Gly Arg Lys Leu Glu Gly Lys Leu lie Leu 
785 790 795 800 

Glu Val Pro Ala Gin Val Asn Pro lie Pro Gin Ser Val Leu Arg Ala 
805 810 815 

Ala Gin Glu Glu Asn Val lie lie Arg Asp Thr Thr Gly Arg He Tyr 
820 825 830 

Lys 

(2) INFORMATION FOR SEQ ID NO : 42: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

{ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..162 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Lys Lys Asp He Phe Tyr Cys Glu Gin Trp Ser Tyr Gly Tyr Lys 
15 10 15 

Arg Leu His Lys Pro Phe Ser Glu Lys Gin Ala Glu Glu Lys His Leu 
20 25 30 

Lys Gly Glu Leu Tyr Thr Ala Val He Gly Ser Ala Thr Gin Pro Glu 
35 40 45 

Tyr Val He Thr Leu Arg Glu Glu Val Gly Phe Phe Ser Val Asn Phe 
50 55 60 

Phe Asp Lys Phe Gly Arg Asp Tyr Leu Thr His Gin Phe Gin Lys Tyr 
65 70 75 80 

Ser Asn Ser- Asn Tyr Tyr Phe Leu Ser Met Ala Val Trp Arg Asp Tyr 
85 90 95 

He Thr Leu Glu Ser His Asp Leu Ala Glu Gly Tyr Thr Tyr Phe Phe 
100 105 110 

Asn Glu Asn Thr Asp Asp Cys Tyr Val Leu Lys Gin Asp Phe He Asn 
115 120 125 

Asn Glu Arg Tyr Glu Lys Thr Glu Leu Tyr Ser Gin Lys Asp Lys Val 
130 135 140 
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lie Leu Phe Pro Lys Phe Gly Glu Tyr Asp Leu Val Leu Asn Pro Asp 
145 150 155 160 

lie lie 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide , . 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 
<B) LOCATION: 1. . 90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Met Asn Lys Arg Met Lys Met Cys Pro Ala Cys Gin Gin Gly Tyr Leu 
1-5 10 15 

Tyr His Ser Lys Pro Lys Tyr Leu His Asp Glu lie lie Leu Cys Asp 
20 25 30 

Glu Cys Asp Ala Val Trp Leu Lys Gly Met Asn lie Phe Tyr Gly Glu 
35 40 45 

Tyr Glu Lys Asp Phe Tyr Ser Tyr Val Pro Phe Met Glu Ser Gin Gly 
50 55 60 

lie Thr Ser Glu Cys lie Trp Glu Gly Asp Leu Phe Asp His Pro Tyr 
65 70 75 80 
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Tyr Glu Asp Glu Asn Ser Asn Asp Met Asp 
85 90 

INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: peptide 

<ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Met Ser Ala Thr Glu He Glu Lys Ala Lys Ala Lys He Thr Ala Tyr 
15 10 15 

Ser Lys Leu Val Ala Gly Thr Ala Ser Ala Val Val Gly Gly Asp Val 
20 25 30 

Asn Thr Ala Ala Asn Ala Ala Gin He Ala Val Glu Asn Asn Thr Leu 
35 40 45 

Tyr Pro Arg Cys Val Gly Ala Lys Cys Asp Glu Phe Gin Lys Glu Gin 
50 55 60 

Gin Lys Trp He Arg Glu Asn Pro Glu Glu Tyr Arg Glu Val Leu Leu 
65 70 75 80 
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Phe Gin Thr Gly Phe He Pro He He Gly Asp He Gin Ser Phe Val 
85 90 95 

Gin Ala Gin Thr Ala Ala Asp His Leu Phe Ala Leu Leu Gly Val Val 
100 105 110 

Pro Gly He Gly Glu Ser He Gin Ala Tyr Lys Val Ala Lys Ala Ala 
115 120 125 

Lys Asn Leu Gin Gly Met Lys Lys Ala Leu Asp Lys Ala Ala Thr Val 
130 135 140 

Ala Thr Ala Gin Gly Tyr Val Ser Lys Thr Lys He Lys He Gly Gin 
145 150 155 160 

Thr Glu Leu Arg Val Thr Ala Ala Thr Asp Lys Gin Leu Leu Lys Ala 
165 170 175 

He Gly Glu Gly Arg Asp Thr Thr Gly Lys Met Thr Glu Gin Leu Phe 
180 185 190 

Asp Ser Leu Ala Lys Gin Asn Gly Phe Arg Val Leu Ser Gly Gly Lys 
195 200 205 

Tyr Gly Gly Asn Asn Gly Phe Asp His Val Trp Gin Ala Ala Asp Gly 
210 215 220 

Ser Val Val Leu He Val Glu Ser Lys Gin He Arg Asn Gly Thr Val 
225 230 235 240 

Gin Leu Asn Pro Asn Gly Ala Gly Gly Tyr Thr Gin Met Ser Glu Asp 
245 250 255 

Trp He Arg Gin Val Leu Asp Gin Leu Pro Asp Gly Ser Pro Ala Lys 
260 265 270 

Ala Ala Val Phe Lys Ala Asn Lys Asn Gly Thr Leu Lys Thr Ala He 
275 280 285 
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Ala Gly Val Asp Arg Gin Thr Gly Lys Ala Val lie Leu Pro Val Lys 
290 295 300 

Val Pro Ser Lys Thr Asn lie Arg Arg 
305 310 

INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

( A) NAME / KEY : Pep t ide 

(B) LOCATION: 1..311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Gly His Asn Met Met Thr Thr Gin Lys Trp Tyr Glu His lie Thr 
15 10 15 

Asn Val lie lie Gly Asn Thr Ala Asn Phe Asn Ser Gly Cys Leu Asp 
20 25 30 

Ser lie Asp Tyr Val Asp Glu Arg Lys Gly Val Pro Leu Ala Ala Met 
35 40 45 

Gin His lie Phe Met Asp Val Arg Ala Ala Ala Ser His Ala Tyr Leu 
50 55 60 
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Phe Glu His Asp Leu Lys Lys Phe Lys Gin Tyr Ala Tyr Val Ala Gly 
65 70 75 80 

Lys Leu Gly Val Leu Leu Ser Val Asn Ser Thr Asp Pro Glu Pro Phe 
85 90 95 

Phe Phe Pro Cys Asp Met Leu Asn lie Gin Asn Pro Met Phe Leu Met 
100 105 110 

Leu Met Ser Asp Ser Pro Gin Leu Arg Glu Phe Leu Val Arg Asn lie 
115 120 125 

Asp Asn lie Ala Asn Asp Thr Glu Ala Phe lie Asn Arg Tyr Asp Leu 
130 135 140 

Asn Arg His Met lie Tyr Asn Thr Leu Leu Met Val Glu Gly Lys Gin 
145 150 155 160 

Leu Asp Arg Leu Lys Gin Arg Ser Glu Lys Val Leu Ala His Pro Thr 
165 170 175 

Pro Ser Lys Trp Leu Gin Lys Arg Leu Tyr Asp Tyr Arg Phe Phe Leu 
180 185 190 

Ala Phe Ala Glu Gin Asp Ala Glu Ala Met Lys Ala Ala Leu Glu Pro 
195 200 205 

Leu Phe Asp Lys Lys Thr Ala Arg Met Ala Ala Lys Glu Thr Leu Ser 
210 215 220 

Tyr Phe Asp Phe Tyr Leu Gin Pro Gin lie Val Thr Tyr Ala Lys lie 
225 230 235 240 

Ala Ser Met His Gly Phe Asp Leu Gly lie Asp Gin Glu lie Ser Pro 
245 250 255 

Arg Asp Leu lie Val Tyr Asp Pro Leu Pro Ala Asp Glu Tyr Gin Asp 
260 265 270 
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lie Phe Asp Phe Met Lys Gin Tyr Asp Leu Ser Tyr Pro Tyr Glu Tyr 
275 280 285 

Leu Gin Asp Trp lie Asp Tyr Tyr Thr Phe Lys Thr Asp Lys Leu Val 
290 295 300 

Phe Gly Asn Ala Lys Arg Glu 
305 310 

(2) INFORMATION FOR SEQ ID NO: 46: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GCCACCGGTA CGGAAACTGA A 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 



(iv) ANTISENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
CCTGAATTCA TGTCTATTCC ATTTTGAAGA 3 0 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
CCGAGATCTT TAACCCTTTG GGCTTAAGCG A 31 
(2) INFORMATION FOR SEQ ID NO : 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
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( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GGGAGATCTC CCGCTCGTGT TGTGCATTA 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 50: 
AAGAGATCTG CAGCCAAGGC TCTCGAAA 
(2) INFORMATION FOR SEQ ID NO : 51: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE : nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GGGAGATCTC AGGCTGCCGC CGTTGA 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
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GGGAGATCTC AC CCCAAGAA CGCCAAAA _ 2 8 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGGAGATCTG AACGTATAGT AATCTATCCA A 31 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
AGTGGCTCCT AG 12 
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(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
AGCACTCTCC AGCCTCTCAC CGAG 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
AGTGGCTCTT AA 

(2) INFORMATION FOR SEQ ID NO: 57: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
AGTGGCTGGC 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

AGCACTCTCC AGCCTCTCAC CGAC 

(2) INFORMATION FOR SEQ ID NO : 59: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
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(B) TYPE: nucleotide 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
<iv) ANT I SENSE : NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GTACTTGCCT AG 

(2) INFORMATION FOR SEQ ID NO : 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
ACCGACGTCG ACTATCCATG AACG 
(2) INFORMATION FOR SEQ ID NO : 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
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<D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE : NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 61: 
GTACTTGCTT AA 

(2) INFORMATION FOR SEQ ID NO : 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GTACTTGGGC 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
ACCGACGTCG ACTATCCATG AACC 
(2) INFORMATION FOR SEQ ID NO: 64 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
AATTCTCCCT CG 

(2) INFORMATION FOR SEQ ID NO: 65 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
AGGCAACTGT GCTATCCGAG GGAG 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 0 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GATCAACTTT TCCCTGTTTG TCCCATTACC GGTTTGAATG AACCGATTGC GCGCCGCGCG 60 
TGTTGTTGGA CATTACCTGC GATTCAGACG GTACGATTGA CCACTACATC GAGGAGAACG 120 
GCAATCAGGG TACAATGCTA 140 
(2) INFORMATION FOR SEQ ID NO : 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 67: 
GATCCGCGTA CTTGGTTTTT CATATTTTGC ATAGTCTTGT CGGTCGGGCA TCTTCCCCGA 60 
CATCATCTAA ATTTGTCTTT ATTGGTTTTT ACGCCACTCA TTGCGGATAA ACAATATTCC 120 
GCCTTGCCGT CGCGAATGTT CAAGCTAGCC TGCATCACCG TAATCAGGTT GCCCGTTACC 180 
GAGCCTTCGA GA 192 
(2) INFORMATION FOR SEQ ID NO : 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 8 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GATCCGGCTG CCCGACGCGC GCAAAATTGC CGCCGAGGAA AGCGCGCACA ACCACGACGG 60 
CAAAACCAGC GTATGGCAAT ACAAACATCT CGTGTTCGGT ACGGCAGGCA TTTTCTGCTA 12 0 

TGTCGGCGCG GAGGTGTCTA TCGGTTCGTT GATGGTCAAC GTATTGGGTT AT CTGAAAGG 180 
GCTGGATC I 88 
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(2) INFORMATION FOR SEQ ID NO : 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 69: 
GATCCCCCAC TTTACCTCGG GCAGATTTTG CGCGTTCATT ACAATAGCGT ATTTATGCGT 60 
TTGCGTTTGC GCTTGCCGCT GCCCCCCCCC CGCCGGTATG GGAAAACATC AATATGGCGG 120 
TATAAAGCGC GGTATGGCGG AAAACCTGCC GTTTCCAAGT TTTATTCATC TTTTATTCCT 180 
TGAGTTTGCC TTCACGGGAC GGGGCGGCGC GCGGAACGCG GGGTTCGGTA AACCGCCCGA 240 
TTCCGCGCCC GCCGAATTGC TGATTGAAAA GCTTACTTCC CCATTTTAAC TTTGCACACT 300 
GATC 3 04 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 



(iv) ANTISENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



GATCAGACCC ATTTTCAGCG CACCGTAAGC GCGGATTTTC TCGAATTTTT CCAAAGCTGC 



60 



GGCATCGTTG TTGATGTCGT CTTGCAACTC TTTGCCCGTG TAGCCCAAGT CGGCGGCATT 



120 



CAGGAAAACG GTCGGAATGC CCGCGTTGAT GAGCGTGGCT TTCAAACGGC CTATATTCGG 



180 



CACATCAATT TCATCGACCA AATTGCCGGT TGGGAACATA CTGCCTTCGC CGTCGGCTGG 



240 



ATC 



243 



(2) INFORMATION FOR SEQ ID NO : 71 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CGGCGGCGTAGTccgccGcgACAGCGTTACCATAAGCGGGACAGACTACACCCCTTTATCTAACCCGC 
AAAGTTTGGATACGGAATTAAAATGGTTGCTTCAAGAAGCTCCCGAAATAGAAAATCCTTTCGACCGC 
GCCGTTTATCTCCATAATAATTTGGCGTATCTTCAATATTTTAAAGATTGCAATAAACGTACTGCCAG 
AAACTGCATGACCTTGTCGCTGATGCGCTCCG 
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(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CGGTCAATCA CAAGAAAGTC AGCCGTCTGA TGGCGAAGAC GGGGCTGAAG GCAGTGATAT 60 
GGCGGCGCAA ATACCGCTCG TTCAAAGGAG AAGTCGGCAA AATTGCGCCG AATATCCTGC 120 
GACGCTGTTT CCATGCAGAA AAGCCGAATG AGAAATGGGT AACGGACGTT GCCGAGTTCA 180 
ATGTAGGCGG AGAAAAGATA TACCTTTCTC CGATTATGGA TTTGTTTAAC GGGGAAATCG 240 
TCAGTTACCG TATTCAGACC CGCCCGACTT TCGATTTGGC 28 0 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 0 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
CGGTCAGAAA CAGGCAAGGT AATGAAAATG CCTGAGGCAC GGACTGTGCT GCGAACGAAA 6 0 

ACTCCTTACC GAAGTCTTCT ATACCCAGGC TCAATAGCCG CTCAAGGAGA GAGCTATCAT 120 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
CGGTCAGAAA CAGGCAAGGT AATGAAAATG CCTGAGGCAC GGACTGTGCT GCGAACGAAA 6 0 

ACTCCTTACC GAAGTCTTCT ATACCCAGGC TCAATAGCCG CTCAAGGAGA GAGCTATCAT 12 0 

(2) INFORMATION FOR SEQ ID NO : 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CGGTGTTTTT CTTAACAATT CGCCGACTTC ATGGCGATAT TTAAGTGACA GTTGCTCCGC 60 
CCACGCAGTT GCGCCGAACT CAGCACCACG ACATTATACT GATTATGCAC ATCGGCAAGA 120 
TCAAACTGAC CTAT CGTAGT ATCGCAGACT GT 152 
(2) INFORMATION FOR SEQ ID NO: 76 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CGGGAGGTTTTGTGCATCCTGATACCGATCGGTTGTTGTTGCTCAAAGGACAGAAGGCCGCTGATAAA 
CGAGATTACCTGTTTGTCGCTATTGACGATTTTTATACTCTGCCATTTTGCCAGACAAAACCGCAGAC 
AGTGCTGCCAAGTTTCTGACCGAACATCTGGCCGACCCCTGCTTGTACCTGATTGAGTACGCTTACTC 
TGACAATGATAGGTAATATAAAGAGCCGTCCAACATGCTTTCGGTGCAGTTTGTTATGATAATGGGAT 
TGGTTGGAGGCTTGCCCGATTTGCTTGTCCGCAGACCAACGGTAAGGCGGAGCGGGTTATCCGTACCT 
TGATGGAGATGTGG C ATGAGGAACAGTCGTTTGACAGAC CG 

(2) INFORMATION FOR SEQ ID NO: 77 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 69 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
CGGAGCATAA AATCGTTATT AAAGATAATG GTATAGGAAC GAGCTTCGAT GAAATCAATG 60 
ATTTTTATTT GAGAATCGGT CGGAACAGAA GGGAAGAAAA ACAAGCCTCC CCGTGCGGAA 12 0 

GAATTC CAAC GGGTAAAAAA GGCCTTGGTA AATTGGCATT ATTCGGGCTT GGCAACAAAA 18 0 

TTGAAATTTC TACTATCCAG GGAAACGAAA GGGTTACTTT TACTTTGGAT TATGCAGAGA 240 
TTCGAAGAAG CAAGGGTATT TATCAACCG 269 
(2) INFORMATION FOR SEQ ID NO: 78 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 03 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 78: 

CGGATGAAAACGGC ATACGCgc C AAAGTATTTACGAAC AT CAaAGGCTTGAAGATACCGC ACAC CTAC 
ATAGAAACGGACGCGAAAAAGCTGC CGAAAT CGAC AGATGAGC AGCTTT CGGCGCATGAT ATGTACGA 
ATGGATAAAGAAGC C CGAAAATATCGGGTCTATTGTC ATTGTAGATGAAGCT CAAGACGTATGGC CG 

(2) INFORMATION FOR SEQ ID NO : 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
<iv) ANTI SENSE : NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 
CGGTTTCAGG TTGTCGCGAA GGCTCGGTAA CGGGCAACCT GATTACGGGT GATGCAGGCA 60 
GCTTGAACAT TCGCGACGGC AAGGCGGAAT ATGTTTATCC GCAATGAGTG GCGTAAAAAC 12 0 

CAATAAAGAC AAATTTAGAT GATGTCGGGG AAGATGCCCG ACCGACAAGA CTATGCAAAA 180 
TATGAAAAAC CAAGTACGCG GATCAGGCAT GGATGCACGA TCCAATCCG 229 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 80: 



CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG TTTGGAAATA 



60 



TTGTGTATCG GGGGGGGGTA TTTGCTGACG TAAAAAACTA TAAACGCCGC GCAAAATATG 



120 



GCTGACTATA TTATTGACTT TGATTTTGTC CTGCGCGGTG ATGGATAAAA TCGCCAGCGA 



180 



TAAAGAATTT GCGAGAACCT GATGCCG 



207 



(2) INFORMATION FOR SEQ ID NO : 81 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
CGGCAACGAT TTGAGCTATC GCGGTTACGA CATTCTGGAT TTGGCACAAA AATGCGAGTT 6 0 

TGAAGAAGTC GCCCACCTGC TGATTCACGG CCATCTGCCC AACAAATTCG AGCTGGCCGC 12 0 

TTATAAAACC AAGCTCAAAT CCATGCGCGG CCTGCCTATC CGTGTGATTA AAGTTTTGGA 180 
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AAGCCTGCCT GCACATACCC ATCCGATGGA CGTAATGCGT ACCG 224 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CGGGAACAGC CATTGCCCAC GCCCACGCCC CCCAAGAAAG ACGGAAACTA CTGCCTAAAT 60 
TTTCGGCAAT CAAGTTGACG ATTAAAGGGT TGGGGGCAGT TGCAGTAATA AACATAGCCG 120 
ACGAAATGGG ATTGGAATGA TAGTTGAC C A AAGCCAAATA TTTACCCATC TTGCCTTCTG 180 
TGCCTTTTGC GGGATTGGAG CCGTAACTGC CG 212 
(2) INFORMATION FOR SEQ ID NO: 83 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

( D ) TO POLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
CGGGAATTCT GAGCAGAATG AAAGAAAGCA GGCTTGATAA TTTCATAAAG TTATTGGAAG 60 
AAAAAGGATT TACCGTCCAT TTCGGTATTC ACAATACGGC TGATTACGGA ATTCCCCAAA 120 
GCCGTAAAAG ATTTACGTTA ATTGCAAACA GAATAACCAA AGAAAAGCTG GAACCAGTCA 180 
AGTATTCGGG CAAACGGCTT ACGGTAGCCG ATGTTTTGGG AATGGAAATG GCTTTCCCAA 240 
CATTATTGCA GGACACCAAG ACGAAACGGA TTTTATGCAT AGCTGTGCGG GAATTATCTG 3 00 

ATATCACTTG AACGATTGGC TTGATACCTA AAAACGGAGG AACCGTTGGC TTT 353 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
AATTCCGTAT CCAAACTTTG CGGGTTAGAT AAAGGGGTGT AGTCTGTCCC GCTTATGGTA 60 
ACGCTGTCGC GGCGGACTAC GCCCGGAGCC TTTTTCCAGT AAGTTTTCGG AAATCAGGCT 12 0 

GTGGGTGGTT TTTAAGAAAT CCAACCAGTC AAACGGCTCG GGGCTGTCCA AAC CGGAC AC 18 0 
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AGGTGCCGGT AACTTTCCCT CAGGTTGATT AACATTACGG CATCCGAATA TAACTTCCCG 240 

CCTGCGGTTT GCCCGAGTTT AAGCAATGCC TGCGTATCGT ATTGATTATA AAGTGTTTCC 300 

TTCCAATT 3 08 
(2) INFORMATION FOR SEQ ID NO : 85: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 104 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 85: 
AATTCGTGTG CCGCGTCGAC AAACCGCTGA CGTAGCGGAT GTCTCATGCC ACGTTTCAAA 60 
GCAGGTTGAT GGCGGTTAGC AACCCTCTGA TTTCACTGGG ATAT 104 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
AATTGCGTAG AGTGGGCTTC AGCCACGTTT TTTCTTTTTC GGTCGTTGAT TGGTGGGCTG 60 
AACCACTTGT TTCGGAAATC CGTATCATG 8 9 

(2) INFORMATION FOR SEQ ID NO : 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 73 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

AATTTCCACC TATGCCCTAC GCAGCGATTA TCCGTGGTTT AC CCAAAGGG TGATTATGGC 60 

AAAAGCGCGG GGTTGAGCGA CCGCCTTTTG TTGCCGGCGT TCAAACGGGT TTTGATAGGA 12 0 

AATGCAGGCA CGAAGCCTCG GCTGATTGTG ATGCACCTGA TGGGTTCGCA CAGTGATTTT 180 

TGCACACGTT TGGATAAGGA TGCGCGGCGG TTTCAGTATC AAACTGAAAA AATATCCTGC 24 0 

TATGTTTCCA TCAATCGCGC AAACCGATAA ATT 273 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 70 base pairs 
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(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
AATTCTTCCG CACGGGGAGG CTTGTTTTTC TTCCCTTCTG TTCCGACCGA TTCTCAAATA 60 
AAAATCATTG ATTTCATCGA AGTTCATTCC TATACCATTA TCTTTAATAA CGATTTTATG 12 0 

CTCCGGTTTA TCGAATAACC TAACTTCCAC TTCCGTAGCA CATGCATCGT AGGCATTCGC 180 
TATCAACTCG GCAATCGCAG GAACAGTGTG CGAATACAAT CTTTACACCC AAATGTTCGA 240 
TTACGGTTGG CTCGAAACTC AATTTCAATT 2 70 

(2) INFORMATION FOR SEQ ID NO : 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 5 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
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AATTATGAAC ACACGCATCA TCGTTTCGGC TGCGTTCGTT GCGTTGGCAT TAGCAGGTTG GO 

CGGCTCAATC AATAATGTAA CCGTTTCCGA CCAGAAACTT CAGGAACGTG CCGCGTTTGC 120 

CTTGGGCGTC ACCAATGCCG TAAAAATCAG CAACCGCAGC AATGAAGGCA TACGCATCAA 180 

CTTTAC CGC A ACTGTGGGTA AGCGCGTGAC CAATGCTATG TTACCAGTGT AATCAGCACA 240 

ATCGGCGTTA CCACTTCCGA TGCAATT 267 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 34 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

AATTTTTATT TGGTTCGTAG TCATTTTGTG CAACTGAACG ATATTCGTTT TCATCATTGC 60 

TAACGTCTAG TGCCCATTGT GGCCCGTAAT AAGAGATTTC GTCTCCTTTT ACATGTTTGA 12 0 

CGCTGACGGC ATACTGGGGA TCGATGACGG ATAATGTACG TCTGTTGACA TCTGCAACGC 18 0 

TAAATCAATC ATCGGTATTG GATAATGCGT TGCCGATGTT TTGACTTGTA TGTT 234 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 
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(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
AATTCGGCCG GCTGTGTCAA ATAATGCGTT ACTTTGGCCG GGTCTTGTTC TTTGTAAGTG 60 
GTGGTCTTTT TTTGCGCGTT ATCCCCATCT GTTTGAGTGC ATAGCAAATG GTGGCTGCCG 12 0 

TACAATCAAA TGTTTGGCGT TCATGCAGAT AGGCATCATG GTGTTGCCCA ATATATTGAG 180 
CCGGTTTTTG CCTATCCGAT TTGACGGCAT TTAGACCGGT AACTTGATGT TTTAAGCTGC 24 0 

CTGTTTGTTT AAAGGCGAAT C C AC AAGTAA AGCGTGTTTC TTGACAGGTT AAACG 295 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTISENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
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AATTGTGTAT ATCAAGTAGG ATGGGCATTT ATGCCTGACC TACAAAACCA AAAACAACCT 6 0 

ACCACCCTTA ATCAACTCCA CAAACCCTCT TCAGACAACC TCGTTTTTTG AAAAACAATC 120 

TGTAAACAGA TAACTGCTGA AGAATACCGT TGCCGAGCCC CAAAACCCGT ACTGCAACTT 18 0 

TTATTGTGAA CTTCCCATTA TGAGAAAATC CCTTTTCGTC CTCTTTCTGT ATTCGTCCCT 240 
ACTTACTGCC AGCGAAATT 259 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
AATTGCACCA CGCGATGATG GGTACGCCTC TGTTGCCATT GCGACCGCCG CCGCCGTGCC 6 0 

CGGTACGCTG GTCAACCTTG CCGCGGCGGA ACGGGTAAAG AAGTGCGCTT CGGGCATCCT 12 0 

TCCGGTACAT TGCGCGTCGG TGCAGCGCCG AATGTCAGGA CGGACAATGG ACGGCCACCA 180 
AAGCGGTTAT GAGCCGCAGC GCACGCGTGA TGATGGAAGG TTGGGTCAGG GTGCCGGAAG 240 
ATTGTTTTTA AATTGGACGG CGAACCGGTC TATTCGTATT GGCGTTATAC CGCCGCAAAG 3 00 

GCAGACCTTG AAACTGGTGC GTGCCGTGCA GGGCATGTAC GGCTATGTGT GCGTGGCGGG 3 60 
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CGGATTTGAT GTGCGGAAT 3 79 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 08 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
AATTTGTTGG GCAGATGGCC GTGAATCAGC AGGTGGGCGA CTTCTTCAAA CTCGCATTTT 60 
TGTGC CAAAT CCAGAATGTC GTAACCGCGA TACGTCAAAT CGTTGCCGGT ACGCAACGGT 12 0 

ACACAAAGCG GTATTAC CGG CCGCAACGCC AGAAAGCGCA ACGGATTTTT AGGTTTGAGG 180 
GTCGGGGTTT GAGTAGTTTC AGTCATGGTA TTTCTCCTTT GTGTTTTTAT GGGTTTCGGG 24 0 

TTTTCAGACG ACCGATGCGG ATTTGTTGAA AGGCAGTCTG AAAGCGGTAA ATCATTTTTG 3 00 

AAACAATT 3 08 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTISENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



AATTCGGAGG AGCAGTACCG CCAAGCGTTG CTCGCCTATT CCGGCGGTGA TAAAACAGAC 



60 



GAGGGTATCC GCCTGATGCA ACAGAGCGAT TACGGCAACT TGTCCTACCA CATCCGTAAT 12 0 



AAAAACATGC TTTTCATTTT TTCGGCAAGC AATGACGCAC AAGCTCAGCC CAACACAACT 180 



GACCCTATTG CCATTTTATG AAAAAGACGC TCAAAAAGGC ATTATCACAG TTGCAGGCGT 24 0 



AGAC CGCAGT GGAGAAAAGT TCAATGGCTC CAACCATTGC GGAATT 



286 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 8 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



AATTTGGATA CGTTGGAAAA GGGATATTTG ATTGGGAATG GGATGAAGAT AAGCGTAGAT 



60 



GAGTTGGGGA AAAAAGTGTT AGAACATATC GGTAAGAATG AACCGTTATT GTTGAAAAAT 12 0 
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CTACTGGTTA ACTTCAATCA GGGAAAACAT GAAGAAGTTA GGAAGTTGAT TTATCAGTTG 18 0 

ATAGAGTTAG ATTTTCTGGA ACTTTTGTGA GGGATTCTAT GAAAAACTGG AAGCAATT 23 8 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
AATTCGGCAC GCAGGTTTTC TAAAAAAAGG CCGTTGATGA CTTTGTCGAT ATTGGCGGCT 60 
TCGGTGTAGT GCGCGCCCGC TTCGGCCGCT CTTGCGCGTC CATGACGGAT TGGAAGAGCG 120 
TGC CGAAGAT TTCTGGACTG ATGTTGCGCC AGTCGAAATT GCCGACACGG GAGGAATACC 180 
TGCCAACAAG AGTGCAGGCA GCGTAATCAA ACCACCCCCA CCCGCAATCG CATCGATAAA 240 
TCCGGCAATC ATCGCAACCA AAC CCAAAGC GAGTATTATG TATAAATCTT CCATGTTTCT 3 00 

TAATCCTGTT AACTTGCACC AA 322 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



<iv) ANTISENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 



AATTTGTCGG CAATCTTCCC GGGTCGCTTT ATTTTGTGCA GGCATTATTT TTCATTTTTG 6 0 



GCTTGACAGT TTGGAGATAT TGTGTATCGG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 120 



AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC CTGCGCGGTG 18 0 



ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAAC CTG ATGCCGGCCT GTTGTTGAAT 24 0 



ATTTTCGACC TGTAATTACG ATTTGGCTTC CGCGCCGGCA CAATATGCCG CCAAGCGGCG 3 00 



CCCACATTTT GGAAGC 



316 



(2) INFORMATION FOR SEQ ID NO : 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
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r 

AATTCGGACA GTATGAATAC AGCGGATTAA TACAAGGTAA GTTCATTACA ACGGAAAAAC 60 
CTTTAAAGAA TAATATGAAA GGTATTACCT TGTTTGCCAA CGGGAATGGT AAATATGCCC 120 
GAGTTTTTCA CTGAATAGCG AATCCAGCCA TTTCTATTCA TATTTGACTG GATGGCTGAA 18 0 

TGTGGACTTT ATAGATAATG ACGATGAAGA TTTAATT 217 
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